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Description 



TECHNICAL FIELD 



[0001] The present invention relates aeneraiiv to m^K^t. , 

nucleic acid molecules, and more ^k^^^^^^T 6 ** detw ™"9 sequence of 
nucleic acid sequences simultaneous^ conposrtions whrch allow the determination of multiple 



BACKGROUND OF THE INVENTION 



40 



45 



50 



55 



10002J Deaxyribonuclec aad (DNA) sequencina is ana nf tho hoci^ ««^~ 

ular biology and plays a rapidly avanL^oW^"^^^ etec^esrtteotogy. It is at the heart of molec- 
eflbrt to read the entire human g^oSTh The Human Genome Project is a multi-national 

to have a major impact on nJ^T^^^T^ ^ EST" ^ haS 
»s success of this project Indeed, a substantial eSoThas b2nTS3l H ^'^S?**™* 0 * wi " the 

ks ot jsssrrrs sir r ssbksks a~ r* ™ - - — 
ss -e :£3^ JsrriK % straps." sffl * in — - — 

oxy-terminators. involves the synthesis of a DfSTtran^t««f » 2*2^.2?^ 74 5463 ' 1977 > which dide- 
» Sanger method of sequencing depends on SJiJE SiZ^S^^^ ' ^ 7I » 
grow,ng strand in the same way as normal deoxynudeotideT L!5t2?£!^? . (ddNTPs) are incorporated into the 
normal deoxynucleotides (dNTPs) in that theTSel^H Soun nZ£Z t^'' ddNTPs di «* fr °™ 

incorporated into the DNA chain, me absence Seff^Z^S ? f*" 1 Ration. When a ddNTP is 
bond and the D NA fragment is terminated ^^ZZe^Z^^ * a nw P«*sphodiester 

' and Gilbert method (Maxam and Glbert. Proc. NM Axd^ M^JlS ^ *. temp,ate DNA - 706 *«■»•" 
method of the original DNA (in both cases the DWnSKttaSS Bo^S^n!!^ * < * 8micaI degradation 
that begin from a particular point and terminate in wXe^^^^r** 8 Populations of fragments 
The termination of each fragment is dependent on tile 522^-^1 "i J fragment is to be sequenced. 
The DNA fragments are separated byi^w\Z«^T*,™Zl particular base wrthin the original DNA fragment 
cytosine. thymine, guanine a.so UncZZ SSlS ! °* ^ ° NA 63868 (adenine. 
[0005] A cumbersome DNA pooling BB^nS^SZ, ^T^^^ 09 ^ 01 9el - 
one of the more recent approaches to DNA^Sr^^ 24:1 1988) is 

DNA templates (samples) and processing ^Tsan^S^^^^^ ^^ * a number of 

end of the processing, the DNA molecules of intra? J Z r epmt ° the sequence information at the 
tagged DNA molecules are pooled, emptied IS"?"* * the beginning. The 

pooled samples, the DNA is transferred tTa ZTs^n^Z tZ^'"*" After electrophoresis of the 
labeled oligonucleotides These merrfcranes IrTmJ^lT , hybndizsd *»» a sequential series of specif to 
ducing. in each set of probing. m*!S^£g££Z 3££?2ZT "* ^ ^ ^ ^ 

reaction and gel yields a quantity of dataeq* valent to Biat obTatne^ro^^ sequenang methods. Thus each 
the number of probes used. If alkaline phosphatoseVus^as ml T*"* and 96,8 *Y 

used which is detected in a chem« U minescenuS,y ^c^Hol^r ^ ^ 1 Z ^° Xetane substrate can be 
the sequences can only be read by Southern S^to^Z'^^^^^ disadvantage is that 
clone in the pool. 9 9 set " ,enan 9 & and hybridizing this membrane once for each 

[0006] In addition to advances in sequencina msthnrini™.^ 

of automated DNA sequencing. Brief^e^etn^s^oTe^Tr ll? ° CCU " ed due to adv «"t 

employed radio.abe.ed components. £u>resce*7yes ar at^e^£££5? ***** rep,aCe mettlods «•** 

minators. Robotic components now utilize polymerase Sain EEJSS? ££ZTZ IS"" °' ddNTP - t6r - 
ment of linear amplication strategies. Current commercial ^ ^ ( ° th<5 

run on a single lane. Each dideoxy-terminator reaction is J«S * dtdeoxy-terminator reactions to be 

for each base type. A.T.C.G). Only one ter^pla SSST(7J DNA ^L ^ ,,UOrescent (on. f luorophore 
the simultaneous electrophoresis of up to 64 sarrSes h m 2^!^ } ,8 n represented P er «ane. Current gels permit 
detected by the irradiation of the gel laWHg^^ "agmerS are 

trophoresis step is about 4^ hours long Each ^ectmntwS^ t 9ht tne fluor °Phore. Each elec- 

ong. taC h electrophoress separation resolves about 400-600 nucleotides (nt) 
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therefore, about 6000 nt can be sequenced per hour per sequencer. 
[0007] The use of mass spectrometry fnr th A c*.^.. ~* 

described (Hignite. In BiochenL, A^ZZs^fj^^^^ of nucleic acids has abo been 
Chapter 16, p. 527. 1972,. Briefly. •^SSS^E^^S^ W '^^ence. 
protected synthetic oligonucleotides up to 14 bases k,™? T^fl * !^ ^ °° ta,ned Dlasma Resorption for 
proteins, the applicability of ESI-MS Sl^SSStZSZ SE2£^ * * 4 . b — h ^ ^h 
Spec 2:249-256. 1988). These species are iort«^n^JSoTl2T^ (C<5Vey * ^ Co ^- Mass 
phodiester and/ or terminal phosS^eT^l^^ ?° Char9e resk,ing at acidic bridging phos- 
tion to sodium adducta ' * the 938 phase "^P* barged molecular anions in acS- 
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is for larger dTa template* ^ ^S^^JS^^^^ * ™ Seated than it 
tion method requires about 50 pmol of ^ **omposi- 

tton. and film exposure For small oligon^eotiL (<i7S, S^mb^ *"* electro P h °retic separa- 

Fourier transform (FT) mass spectrometry (STi fast^ a n^ dectrospray ionization (ESQ and 

charged ions measured at high (10 s ) reS^nl? r«^J^! .! ^ Dissociation Products of multiply- 
sequence in less than one ^onlb^pfo^^ P-^ng theM, 

For molecular weight measurements. ESI/MS has been «t«k^ iT «! ' Chem - ^ ' ^ 4893. 1994). 
22:3895. 1994). ESI/FTMS appeal to be a vatawHo™^ f ^""i fragments * al.. A/uc. Addis *es. 

mutations in nucleotides as large as 100 ^ZZSSZJSS? l^T™ ™*° <i * *" 8 ^cing and pinpoint 
mer using a more sensitive ES° ^Z^l^^^S ^ ^ 3 * " - *S 

Uis ot^rTen^^^^ one ir .which DNA * labeled w*h individual 

of DNA have been separated by electropho^^ after a -^ures of sizes 

the mass spectrometer to both seoaratea^ rfJLl Z n?^ ^° a< * descrlbed above utilizes the resolving power of 
at best.) AO of the procedur^deS <* <■"""• a d» icuH ££S * 

of DNA fragments that vary in ^Z^r^e^ ^^^^ 3 primerWsenS 

the original primer, a replicated ^ I^SESJ^S^ T m °' eCU,e8 63Ch ^ 

DNA molecules is produced that contain the primer and rfflT in i^STS ^ ** termin a*or. That is. a set of 

[00101 Brenner, et al. (Bio,. MaTSZ ETSfSSE TS^SZ ^1°^ * 003 
stable isotopes of sulfur as DNA labels « i^SST 1 ^^ to U8e fo " 

electrophoresis. Using the a-thio analogues of the ddNTPs ? ? ™£! have ^ se P arated by capillary 

fragments. Therefore each of the fouT NpeTof DNaTL^T £££ !^ " ""^ated into each of tfSoNA 
uniquely .abated according to tine termin^^e S^SS^L ^^nated) can be 

and 36 S for T. and mixed together for electrophoreses SunTELl^ 9 * S ** Q - ** S «* C. 

ink-jet printer head, and ten subjected to^S^^^l . ° ,a,ew p,co(iters are obtained by a modified 
of the labeled DNA to SO a . wn^^e^to^anrtSfe^t^n^l^ 8 " * "» ^^ospha S 

SO a mass unit representation is 64 for fragments e^iro in A SSl?^ "**" 01388 ^octromeler. The 
olution of the DNA fragmerrts as they eme^^^^^^ 3 ^^ Maintenance of toe res- 
the mass spectrometer is coupled directly to the caoill^v ^<*^?i on **n? suffKaentty small fractions. Because 
e.eorophores.s. This process is urtibrtunate^^^ ^"2? iS de,ermined «V the rate of 

Two other basic constraints also operate on this aoproaS- affS l£2 ~ 9 has not been commercialized, 
baric contaminants, can be tolerated and «hWh-^ , '! 11 . ?** COmponents ™* mass of 64, 65, 66 oresns^ 

be enriched to >99% to eliminate contaminating 32 S Tsotope^hat Jr/^^^^ f^ 3 "*' Qther isoto P^ must 
[001 1 J Clbert has described an automated DNA sequencer (ERA 921 08678 a SL 

thesizer. an array on a membrane a detector whirh H^^l J T* 92108678 2 > •« consists of an oligomer syn- 
thesizes and labels multiple oHgomef oT^C p ^^er^^ V"™ ^ ^osizer sy^ 
DNA on membranes The detector identifies l*S2SES£S anTthe^ USST" ^ ^ to ^ imm °b"'«d 
wh,ch constructs a sequence and then predicte the ^S£S?3iI^ - ^ th ° Se P 3 "^" 5 to a central computer 
iterative process, a DNA sequence can be otJSEES^ST* ° « an 

" based on ligation of oligomers (U.S. Patent 

Plate. A primer is hybridized to a Sna template ano^enl £5 JSSS *^!** kb « to a nuclac acid tem- 
to the primed template in the presence ii^nSEI^SS^^ iS a,SO hybridized 
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information concerning the nucleotide residue in the nucleic a «H t^i*- ^ ^ 

base paired. ™ JClac aad tem P' a t« which the labeled nucleotide residue is 

[001 3] Koster has described an method for ieru ,onri™ oma . 

sxonudsas. (POT/US**^. ^^d^T^^'SS^^ d ^ ,lOT ' J,0NAb >'»' 
Sangs, -sacta, is not used). ONA is donsd i«SS«,S SSTJ * e,:,,, dMsrmlned <•» 



[0014] 

U»SS^^ A * OCsl 

is based on the concent of h^iHw tnn ^ 



The demand for sequencing inlbrma^n * targer tr«n caTb^SS^L . _ 
such as the ABI377 and the Pharmacia ai f nnl^f supped by the currently existing sequencing 
small number of tags which can be resd^Sna <* 9,9 «•«"« technology is the 

K lecS S ISS^TS^^^ ^ * "~ to sequence r^e* 

other related advents * sensrtivrty than the methods described above, and furih~ ^ 



other related advantages. ™„ y man tne methods described above, and further provides 

SUMMARY OF THE INVENTION 

20 

» particular nucleotide and det^able^^^ m0,eCU ' e ' Wnerein a *0 « Relative with a 

ments by sequential length, (c, cleaving me IZ^T^TJa^T^T^ W Separath * tagged frag 
centspectrometry or potentiometry. and i-JLSSSSSS SZl^,!?*^ t30S * "on-fluorel- 
embod.ments. the tags are detected by mass spectrometin hSr^ti I S^^l? " UC,e f aad molecule. In preferred 
static amperametry. spectrometry, infrared spectrometry, ultraviolet spectrometry or potentio- 

30 100181 h anolher aspect « he ^en«on provides a compound of the formula: 
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ZSZi optioTSs SSSSSS^TSS? ST r?*" 3t ,6aSt °" e - M-gen and flu- 

allows a ^-containing moiety to t^SS^SS^ rtT™^ ^ * *" °^oup^5, 
comprises a functional group which supports a single ZSil^JSS^J!^" ^^ning moiety 
spectrometry and is selected from tertiary amine ouat^rv anlnf !^ the com P° und * subjected to mass 
from hydroxy., amino, thiol, carboy acid. ^^ ^l^LTer^^^ elected 
of the group toward coupling with other moieties, or is a nuc!eic acri fr^o™ « f ^ ^ ° r the activity 

KUtr ^™an~^ 8 *-* - compounds of »e fo rmu* 

hydrogen and fluoride, and optional ato^TeleSSTom "P**"™** comprising carbon, at least one of 
cranio group which allows a ^-con^^^^ P^orus and iodine; L is an 

T™-containing moiety comprises a functional groupwhiS f su^oS aTJ£ °' coni P°""cl. wherein the 

subjected to mass spectrometry and is se.ecLton ttSS^^ZZT" ** ^ com P° u ^ « 

nudeic acid fragment wherein L is conjugated toihe ^MOU, i^SETJK * ^ ^ ° r9artc *** MOI is a 

two compounds have either the same T^ or the same mSi 8 3 ^ * MO,; and no 

[0020] in another aspect, the invention Drovides a mm™,*;* 

T--L-MOI. wherein. T™ is an organic SSZL^ 8 C ° mp ° Und - » a '—ula 

hydrogen and f.uoride. and optional atoms selerttdfrom ^ com P risin 9 carbon, at .east one of 

organic group which aHows a T»»-contain^ S3? fe ^i^tSSST; ™^ a "<* 'odine; L is an 

T™-containing moiety comprises a functional g^which suSo^ . J?*! °* ConipoiJnd . herein the 

^^^^^^ 
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mr. pnospnon* and iodine: L is a^Saf^SSf^???^ """^ ™ y9ea *"<« 
remainder of are conpound. wnerein *^J£ZLZ!t£? ^-«**»r.n, mo„ty to be cleaved ton tne 

ionized cnan,. aate'Xn i. ceJ^LSZ ^.SE^EffiSEES * 
ternary amine and organic acid* MOl is a m,^^ =„-.w ^ . ' J 15 se,ecTea "° m tertiary amine, qua- 



org. nio 9ro op Mh a,to«, a ASSESS? !E3SJ5E1^!E*^ * - 

T™-corrtaining moiety comprises a function^ nrr- ~ -_T_L ■ . remajndef of the compound, wherein the 

is subjected to massspec^me^ SeSntt^ 3 S,n9,e ™ ized "hen .he compound 

nucleic acid fragment SS^oSiSd^S. T^Z'^T™* -nd «** MOIfea 

members within the second pl£«Sy S Tan SS p^ h »*" *e 3' end of the MOl; and wherein all 
selected from ddAMP. ddGMP ddSSp^S JT^* 8 the «™ °Weoxynudeotide 

pounds of the first p.urality is r^^ZZZ^Z^ n!Z^ 2 ° Mea *>™*«™° P^ent in the com- 
10023, '"^0^*0^^^ 

of container sets, each container set comprisino at least f K/« Jj Tu 9 , ,ys ' a T " 8 W comprises a plurality 

second. thirf. fourth and fifth corrtaineJcTn^ a vector, a 

detectable by mass spectrometry, comprising cart^Tat least onTS hZ^lZ^^"™"^*** 
selected from oxygen, nitrogen, sulfur phosphomsai Lnt , ™! "V*"* 6 " and fluonde - and optional atoms 
moiety to be cleaved from Z reT^Iil^ 8 
group which supports a single ionized charoe stata^hJ.' Tf* ^ -containing moiety comprises a functional 
selected from IZry amine qua" nSyt^ne aS oraaTc ^LTr^ '* 10 171888 *>«*°™try and is 

jugated to the MOl at a location other ITyrtXlSf^S ^ " C 8dd frasment wherein L * con- 
containers is identical and comptem*^ to a oSion S SS stf *at the MOl for the second, third, fourth and fifth 
within each container is dHI^CS^^^*^ °» Set * — ** T- group 

SR. sy^nS^a^ ~[ *™ * 8 nUd8ic 8C * 

cleaves from a taggednucleic S^e^a^o^^^lf T^SL™*"* *** « a *™«*> ™ apparatus that 

ssr^-^ ^ an -^iS^^S a p " ia * r nudeot * e 31x1 detectaue * 

So. ^Tgr^iT^d^ 100.200.250. 
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** *" SyraheS * - — esters of a set of 36 pnoexnen*,*, 
,k,,,,:,W, * *" Sy * KiS 01 3 861 °' 36 — *— « PN»=chen*a^ cteavaue mass 
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acids. 



spectroscopy tagged oligonucleotides 

spectroscopy tags. 
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Rgure 11 Olustrates the ^e^^T^^^^ 
Figure 13 depicts a modularly-constructed tagged nucleic acid fragment 
DETAILED DESCRIPTION OF THE INVENTION 

as harin, m, aOT «a, fcrn^a: '"""OK"** !»"*>«> a tag • T>iu*co nwU ndsollhai, w ^„ ByB , ~£ 
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^"^"^ - «-■— -r m. b. -.jssssss^astss aEr *• io * ,M 10 t - 

T-L-MOI and T-L-L^ 

■ 

[0029] For reasons described in detail holm* t 1 

tat cause the .abi.e bond(s) to breaj Z retTs^ ImoS £37* ISf ST*** ****** *> «™*on. 
•> then characterized by one or more analytical technfauLTo 3eS Z 'TZ * *° COmpound " ™° tag moiety 
the tag moiety, and (most importantly) U^^^^J^fT^^^ "formation about the structure* 
[0030J As a simple illustrative exarrnle of am^lTJ. ^ 01 desponding MOI. 

reference is made to the following structure (i): compound of the invention wherein L is a direct bond. 



30 




^(Nucleic Acid Fragrant) 
35 ^ I H 

Liricer (L) cornponent 
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Tag component \ M , 

*^ Molecule of Interest 

component 
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In structure (i). T is a nitrogen-containing Dolvcvdic arnm**.> i , 

cificaBy a nucleic acid fragment termina^K JS^SS^^t .1!?^ grOU » X is a M <* 
amide bond is labile relative to the bonds fn "because Z^SS^X^T 30 ™« 

cleaved (broken) by add or base conditions whfchTeave Z bSSTSlT J?. * b0nd ta *"»*aflJf 

moiety (i.e.. the cleavage product that contains T> C ^3?^ 1^ Unchan "*- ^ a 4 



55 



6 



EP 0 992 511 A1 



w 



15 



SO 




N ^(Nucleic Acid Fragment) 
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H 



acid or base 



OH R,N ^ (N * Wb Aca Figment) 



Tag Moiety 



Remainder of the Conpound 



25 



30 



35 



S true ttre (S) 
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Fragment) 
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MOI 



It is well-known that compounds havino an orfhn r,^^^ 

pho.olylical.y unstable, ^Lt expZ" J sl ^^r^ b ° X6d atoms ■««" structure (ii), are 
selective cleavage of the beruylaTne bond (st SS2L?J2 n^f r Waveten 9 th «■ 

the same T and MO. groups as structure (i). ho^e^theT^ ^ " f (B)) Thus ' structure ('0 has 
there is a particularly labile bond. Photolys s *ZZZ££!ZZZ? atoms and ^ w«Wn which 

remainder of the compound, as shown bSZ 1 ° ^ re ' GaSeS 9 tag ^containing moiety) from the 
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Structure (ii) 
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(Nucleic acid 
Fragment) 



Tag Moiety 



Remahder of the Conpound 



25 



30 



35 



40 



[0032] The invention thus provides comocunciR whir*, , ^ n 

a cteavage reaction so as to release a mT^eTfron^r^l^ SU 7i° appr ° prlate <**™V* conditions, undergo 
may be described in terms of the tag rrX? 7e^(^ D ^Z °f the compound. Compounds of the invention 
two groups together. Alternatively, the ZSu^s^ ^^11 k J Lh \f nd ^ ,abi,e bond < s > join the 
which they are formed. Thus, the TcompSnS n£y bl SlZSE. TJr^ ^ tm * ** components from 
tart and a MOI reactant as follows. * descnbed as *™ taction product of a tag reactant. a linker reac- 

[0033] The tag reactant consists of a chemical hanriio rr ^ _ 

is seen to have the general structure: ™ ^ 3 vanab,e component (T.J. so that the tag reactant 

T vc"Th 

To illustrate this nomenclature, reference mav be man* *r> ^ .~*. /-* ^- 

to prepare the compound of stiucture m T^tao T ( " ,, • M ShOWS a ta 9 reactan » may be used 

a tag handle, as shown betow: * e *™ n3Vin9 Structure « contains a tag variable 

component and 
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Structure (iii) 




Tag Variable 
Component 



Tag 
Handle 
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to 



EeactT^ ~" — fcr reac** the *g reactant w«h the 

active state, so it is ready - to£3j3? Sr head SS^SE?" ^ is in a 

.he r , : A a. arsrr a ^ - T * 

co^undstc^^el^ JSSlSM " ^ h sets of 

HMM menOers mW be <W*fr^^^ « « ■» 

We component of structure (Hi) may be one member of the LkJng seT^^n^S * ^2 ' ? ^ 
guished by their U V or mass spectra: 9 sei wnere men * ef5 of the set may be dstirt- 
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[0036] Likewise, the linker reactant mav ba deen-ihaH in .„ ^, - . . . 

least two. each of which may be designated as uT^TnTnu??? * J*""** handles (there are necessarily at 
ponent consist of the required fab^e^^ *• ^ com- 

eties effectively serve to^eparateS KM«S/? S^TtTA ^ L * ^ *• ° ptional lab "° 
w^in the linker ,a*,e component. Thus. ^nT ~ 3 — **« 
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Structure (iv) 
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linker reactant may have an L* grau^tfS ca« T^S^ ^ 'J*'" 0 * StmCtUre ^ a,so " ,usfra "* ** a 
reactarns may have an L 1 group buTo* 2 T L W .TSE^JJS^T h *"* Ukewis f- 
groups. In structure (iv). the presence of the orou^P-n^ to ^! . P TOy have neither 01 L *» L 3 

protected from reaction. Given this ^^L^ ^^^^^''^ that *• group is 

with the amine group of the linker reactant fr v) to far* fSlSSlSS lS^? 9 TO may c,ean,y react 

r0O3 9J The MO, reactant , a su^y ^r^^^ fe . 
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nudeic acid fragment, a suitable MOl wanant ;. , . 

Phodiester group and then to an^e^n ti^^T/* ^ ^ te * ^ ^up to a phos- 
with the carbony. group of structure!). mTfcllT^Z'^ZT^^ ™ S **» ^ then react 
quentfy activating the carbony! group tc^ard reactu^ w,i nTf 9 and Preferably after subs " 

* [0040] When viewed in a 10 **" M °« 10 

handle and a tag variable conpon^? a t^ a ^^^ ■ «* octant (having a chemical tag 
and 0-2 optional labile moieties)and a MO rea^mS^ 9 , ,W l ch « TI,cal ,ink * handles, a required labile mS 
of interest handle, to form T-l'mo, ^ to t m ^? 7^ ^"** ^chemical ™^ 

reacted together to provide T-L-Lj,, and then tt»Sor ££L TZZZSZST ^ ^ reactarrt are fir* 
,0 ( ,ess Preferably, the linker reactant and the MoTreacfar^^!^^ . 50 88 to provide T - LM 0'. or else 
MO. is reacted with the tag reactant to prov^T-L^F^S^^ to provk,e M--MOI. and then £Z- 
T-L-MOI will be described in terms of the teg rea^'^J^^ convenienc * «Wunds having the formuto 
fonrn such compounds. Of course, the JISSSSSZ ^ "** ™> °° «ES 

atoms from the .inker handle that was used^JeCreac^to^T *" * 9 h>nd * SOma * - oT^ 
this group was present in T-L-MOI and will n irh«™T,. Z octant to the inker reactant the optional labile moietv Jif 
*> the precise structure of L 2 ^ ^Tj^Z^^^tf °" K£5* on 

to as the T-containing moiety because T w?^^^^ converten ^ the tag moiety maybe reSrS 
[00421 Given mis introduction to one asS^^^"^ ?" ^ * "-"» * 
descnbed in detail. This description begins wrthfce followT™ 22 Z ^ Var '° U8 "Wents T, L and X wOlbe 
in describing T L and X. fe,,OW,na of certain terms, which will be used hereinafter^ 

2S [0043] As used herein, the term "nucleic arw tra***—* 

target nucleic acid molecule (i.e.. comptemen,^ ? K * ^ l m °' e ? J,e which is complementary to a selected 
thetically or recombinant* ProducedJncluS^^ ^ TOy be derived nature or syn^ 

stranded form where appropriate; and includes an ^Zon^cTeL^^n^Jf ' ** 09 ln doub,a or single 
analogfe.g.. PNA). an oligonucleotide whichls e^eS^??!.^^- " ^ a P"' 1 ™- a probe, a ructeicIS 



anatog(e.g.. PNA, an^onucreo^e wn^ Z^ ^J^T 3 ^ a P^eT^cTeic acS 
cteaved chemically or enzymatically. a nucleic aSSls ermfnated w^rtH^f"™*^ a nuc,eic a <* *** is 
end wrth a compound that prevents polymerization « 5 o?^^?, ^ erm,nato ' or at the 3' or y 

of a nucleic acid fragment to a selected target nucleic aC H Z^J.?' C ° mb,natons hereof The complementarity 
70% specific base pairing throughout SSSliSS? *" ^easTaooS 

about 80% specific base pairing; and most preferably 1 2SZLT2T£ w S" "** exhfcite a * '^st 

35 and thus the percent specif ic base pairing, are wett ^ in ^^oH ^ dete ™™9 «"e percent mismaS 
unction of Je Tm when referenced to thefolly base ESSSt" b ~ d ^ percert m '^tch as a 

[0044] As used herein, the term "alky. - alone or in rn^TL _ 

chain hydrocarbon radical containing fronV; to™ ZI^^TlTI* 3 Saturated - straight-chain or branched- 
Examples of such radicals include, out are n£ I mite? o^„TJh5 ^ ^ Preferab,y from 1 to 4 - <arbon^toms 
« tert-butyl. pentyl. iso-amyl. hexyl. decy. aS the ^ ^^^JT**- iS °- pr ° Py »- ""^ is o-outyf. sec-S? 

ssr* Examp,es - diradica,s - « 2 «^ a 5K7scs?5 

• Pnoat, e rnTce"rtnt^ 

2 to 4. carbon atoms. Examples of such radicals incluS tlZ not 1^ 'T 2 *° 6 and ™' a P^erably from 
a "d 2-butenyl. E- and 2-isobutenyl E- and Z r3ir^ h ,ed ,0, ethenyl ' E " and 2-prcpenyl. isoprooenvl 

sti.ig W -ch^ The term "alkenylene^e^a 

2 to 10. preferably from 2 to 6 and more preferably fromTto 4 <^nS^on;^" Ca ? 0n dOUb,e bond in a tota ' rt <™ 

[0046] The term "alkynyl." alone or in combination refers o ^li^!.! ^ ^ ^ the ,ike - 
having at least one carbon-carbon triple bond in a to^i Sta^a JTS" "f H," branched ' cha '" hydrocarbon radical 
2 to 4. carbon atoms. Examples of such radicLls inSude^ S-S ^ ? m 2 to 6 and more from 

gyl). butyny.. hexynyt. decyny. and the like. The te m TaLvTene" 7JnT„ ' lacet **"*)- P'opyny. Jopar- 

branched-chain hydrocarbon diradical having at \IZ onecln^n-c^ ^ C r* ,nation - refa rs to a straight-chain or 

100471 " ^ "~" — - " — -» '» • — ~™ =, ca^on aBms 
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which number from 3 to 8 and Drefa-ahiv frr™ n» e 

[0049] tv^ * — 



carbocyclic 



* — ^ - — -^M.. y ui yiienyi, napnmyi, indenvl inrJanvi aT,,!^ ' — » — ' WW,M «»«- yuup selected 

'° a™ 8 * ^"P from the gn^^tt^;"^"^ «"°™* a^anlhraceny.; or a heterocyclic 

zolyl. 2-pyrazolinyJ. pyrazoMinyl. isoxazolyl. .sothiazolyl i yT«S£?' 2T£ ^^'^ ""W imidazolyl. pyra- 
dazinyl. pyrimidinyt. pyrazinyl. i. 3 , 5-triazfcwt T? sit^^? 01 * 2> 3 -**»«* 1. 3. 4-tniadiazoM ovri- 
•"""W^^S^^ ^"doly,. 3H-indo^ 0 .^ 

qujwlizlnyl. quinolinyl. isoqu.no.iny,. ^SSoSS ^'7^^. ber^azo^ piirinyl. 4^ 

^SJ^Ct^^ to - — .ents w«ch are 

omethoxy. alkyi. alkenyi. alkynyf, cyano. caiSxy nrtro ' **'uoromethyl. trifluor- 

alkytemino, alkenyiamino, alkynylamino. afc5^^3?ZJ ,2 £? , * Man * aJKoxy. alkenoxy oT^lkynoX 
« pholinocartoonwamino. thicmorpr^i/noca^rtS^ N Tf n LnH ^"^^amino. a.ky.su^amino^S 
yurea^-hydroxylurea; N-alkenyforea; ^^.^^JrZ^^^^^ ^-^^^Z- 
alkyl)hydrazino; ^--substituted suWorylheterocydW-araZ S^ZSST** tnKJa n^*ubstituted aryl; N.N-{aryt 
tuted heterocycfy.; cyctoalkyMused ary.; ary^^^!^^^^: <**>■■* and cyckakenyi-sS 
amonyl; aliphatic or aromatic acyi Jbstrt^a,^ J l^^^ *™ 01 aSpha * B " aromatic acyTamSc- 
* aliphatic or aromatic acyi-substituted acy.; cydoaS^^.^ ^^^Voxy; Ar". Ar-^substiSrted^. 
phosphorodiamidyl acid or ester; ^ cl «lkyicarbonylalkyl. cycloalkyt-substrtuted amino; aryloxycarbonytai^-' 



JO 

[0052] 



[0053] The term alkenoxy alone nr in ^ «^ , 

^ is as defied abovTp^°^ «5R n^enTS S T ■» term 

"aikynyr « as detined ESSoSXZ vnci'Sef ? a,kyny '-°-' "•-*» •» term 

EE? ^-T 00t " mi,ed t0 - P«W>*y. 2-buW2y anTthTiikT * SUftabte ^kynoxy radicals 

53 £ = ^^^VX^^T — * *° — abce 

a radical of formula alkyt-NH- „ (^^J^SEEX W di ^ sijbst ^ amino radical !/ e 
aikylamino radicais include, but are not limlS T ££Z?lZ2 * 
« butylam.no. N.N-diethylamino and the like methylam.no. ethylam.no. propylamino. isopropylamnino. t- 

SSL ~ m ^^^:z%zz s^:—- - — * — *«*. « ,^ N . 

"ylamino radicate * ih. alMairt™ radical " ' aa,Ca, " noUn enm *«- An exantfa of m Ska-' 

[0058] The term "alkynylamino " alone or in ™™u: * 

» wherein the term "aikynyJ-Ts as defined I ^ ZSZ^ 0 " " -W* °r (a«kyny0 2 N- 

nylamino radicais is the propargyt amino S. 4,18 ^'^ ,S not an gamine. An example of^X- 

[0059] The term "amide" refers to either -NfRh Ci n\ m rxx i 

hydrogen as well as other groups. The term "substituled = amide r - 2££!S, K"*"* * * d f ned herein *> '"elude 

" 5 S ^oTar^adiS:^ « ~ ^ ** ^ * - — 

[0061] The term "arylamino." atone or in l£SE£ T^STStXtT* and * e ,ike 

----^esofar^inora^s^ 
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. 3- and 4-pyrkJylamino and the like. 

[0062] The term "aryl-fused cyctoalkyl." atone or in combination, refers to a cydoalkyl radical which share* h*n 
[0069] The term TiaJogen" means fluorine, chlorine, bromine and iodine 

[0071J The term "hydrocarbon diradicaJ" refers to an arrangement of carbon and hvdrooan ^s^. , . 

hydrogen atoms inorderto be an independent stable molecule /rhus. a hydnS^rioChS SjncfJS 
on one or two carbon atoms, through which the hydrocarbon radical may be bonded to othe^SnS^lS^TSS 
s nylene. alkynylene. cycloalkylene. etc. are examples of hydrocarbon diradicals A,ky<ene - alke " 

[0072] The term "hydrocarbyl" refers to any stable arrangement consisting entirely of carbon and hvdronen h™ 
a s-ng e valence srte to which it is bonded to another moiety, and thus includes radicals knownas afovl ^aikawTaMcln? 
cyctoaH* L cycloalkenyl. ary. (without heteroatom incorporation into the aryt ring). aSwXES ImSSL. tSt 
carbon radical is another name for hydrocarbyl. y wyl ana me ,lke - Hydro- 

' J 00 *? 1 , The term ■hyaYocarbylene- refers to any stable arrangement consisting entirely of carbon and hvdronon 
ing two valence sites to which it is bonded to other moieties, and thus includes al^ene alk^^n^Z^" 
cycloalkylene. cycloalkenylene. arylene (without heteroatom incorporation into 5e Z^L^Tt^'J^^ 
larylene and the like. Hydrocarbon diradical is another name for hydTocaroylene. 9) ' ary,a,kylene - a,k r 

[0074] The term "hydrocarbyl-O-hydrocarbylene" refers to a hydrocarbyl group bonded to an o™«, *t nm ^„ 

[0075] The term N-(hydrocarbyl)hydrocarbylene refers to a hydrocarbyl ene orouo wherein nn.ni* a w . 
sites is bonded to a nitrogen atom, and that nitrogen atom is JESS* ^^i^S^VTml^Z 
group The term N.N-di(hydrocarby.)hydrocarbylene refers to a hydrocarbyl^^oli^e^^T^r^^^ 

[0076] The term "hydrocarbyfacyl-hydrocarbylene- refers to a hydrocartoyl group b^Tc^^STc^ » 
group to one of the two valence sites of a hydrocarbyl ene group. mrougn an acyl (-C(=0)-) 

[0077] The terms "heterocyclylhydrocarbyr and "heterocylyr refer to a stable cyclic arranoemenf nt at h ^ 

UP 40 ^ at0mS < referred to 35 neteroatoms) seiecteo" £ c^^ IKES 
and sulfur. The cyd.c arrangement may be in the form of a monocyclic ring of 3-7 atoms. orHlicSortSS KTaEmf 
The nngs may be saturated or unsaturated (including aromatic rings), and may optiona ly be tara*^ wtll T!i 
sulfur atoms in the ring may be in any oxidized form, including thf quaternTi form^!^^ 
carbyi may be attached at any endocychc carbon or heteroatom which results in the creationo?* ^ V *Z 

n 3 ™ 6 ;^^ 8 indude s7 membered "-»«** ^t?^ 

[00781 A substituted heterocyclylhydrocarbyl refers to a heterocyclylhydrocarbyl as defined above who™* i M «* 

one ring atom thereof is bonded to an indicated substituent which extends off ofthe ring 

[0079] In refernng to hydrocarbyl and hydrocarbylene groups, the term "derivatives nf a™, «r • 

a=e^^ 

[0080] The term "activated ester" is an ester that contains a "leaving group" which is readily displaceable by a 
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T??*™*' Sl f 1 h ^ an amin9< analcoho < or a ""cleophile. Such leaving groups are well known and include with-. 

.^J^T ^r^ 1 ^" 71 ' N - hydroxyb8n20lria2o(e - n£ "°9en (Glides), alkoxy including tetrafluoropheno- 
lates. thioalkoxy and the l,ke. The term "protected ester" refers to an ester group that is masked or otherwise m££Z 
See. e.g.. Greene. "Protecting Groups In Organic Synthesis." unreacove. 

|_ ^ ln view °| , tne above definitions, other chemical terms used throughout this application can be easily under- 
stood by those ofskiinthe art Terms may be used atone or in any combination thereof The prefeTeWrle^ 
ferred chain lengths of the radicals apply to all such combinations. P 

A. GENERATION OF TAGGE D NUCLEIC ACID FRAfiMFMTR 

i^ 0821 -. "! 1ed a ^ Ve ' ° n8 3SpeCt 01 1116 P resent inven «on ProvWes a general scheme tor DNA sequendng which 
altows the use of more than 1 6 tags in each lane; with continuous detection, the tags can be detected andT^u«£ 
read as the size separation is occurring, just as with conventional fluorescence^based seque^nTrhte sSSrTS 
appticabte to any of theDNA sequencing techniques based on size separation of tagged rnSeSleTsu^etja^ 
deSTbe^T PfeSent ,nV8nti0n - 35 We " 35 metnods sequencing nucleic adds, are cfisc^Tn rnSe 

1. 



20 [0083] -Tag-, as used herein, generally refers to a chemical moiety which is used to uniquely identify a "molecule 
of interest" ard more specifically refers to the tag variable component as well as whatever may be bonded a^cSZ 
to it in any of the tag reactor*, tag component and tag moiety. 51 ciosery 

[0084] A tag which is useful in the present invention possesses several attributes: 

25 V "J 5 cap t bie ° f bein 9 distinguished from all other tags. This discrimination from other chemical moieties can be 
based on the chromatographic behavior of the tag (particularly after the deavage reaction) te sp^S^Tc « 
Potentiometoc properties, or some combination thereof. Spectroscopic methods by SS^J!SSSSSS. 
guishec I include imass spectroscopy (MS), infrared (IR). ultraviolet (UV). and fluorescence, where mT^WuV 

- or^te^d Pref6rred SPeCtroSC ° PiC methodS " Potentiometric amperometry is a preferred 

2) The tag is capable of being detected when present at 10 22 to 10 s mole 

3 !,i h9 , ta .9 p ° s f e ^f s a che^cal handle through which it can be attached to the MOI which the tag is intended to 
uniquely identify. The attachment may be made directly to the MOI. or indirectly through a linker" or.™ 

4) The tag i « chemically stable toward all manipulations to which it is subjected, induding attachment and deavaae 
35 from the MOI, and any manipulations of the MOI while the tag is attached to it «acnmeni ana aeavage 

5) The tag does not significantly interfere with the manipulations performed on the MOI while the tag is attached to 
it For instance, rf the tag is attached to an oligonucleotide, the tag must not significantly interfere iSJ ^hybrid? 
zation or eraymatic reactions (e.g.. PGR sequendng reactions) performed on the oligonucSowJ StoV» toe 
tag is attached to an antibody, it must not significantly interfere with antigen recognitior, by toVamfcody! 

[0085] A tag moiety which is intended to be detected by a certain spectroscopic or potentiometric method should 
possess properties which enhance the sensitivity and specifcrty of detection by that method. TypicaHy w£ t££ 
w,ll have those properties because they have been designed into the tag variable component wh ch wnltvo callTc^^ 
stituto the major portion of the tag moiety. In the to.lowing discussion, the use of the ^^ W^rl^T^Z 
™ e * e - «ne cleavage produci t that contains the tag variable component,, however can IXSSES refer 
to the tog variable component .tself because that is the portion of the tag moiety which is typically responsible for oro- 
vding the umquely detectable properties. In compounds of the formula T-L-X. the "T" portionwiH^r^^ 
component Where the tag variable component has been designed to be characterized by e T maZ ^o^w^! 
T portion of T-L-X may be referred to as T™. Likewise, the cleavage |^ST«5SStlJ 



a. Characteristics of MS Tags 



ss 



f 0086 ^ Where a teg is ana,yzab,e bv 01355 spectrometry [i.e.. is a MS-readabletag. also referred to herein as a M«? 

ment in the design of MS-readaWe tags to incorporate therein a chemical functionality which can carry a oosSorW 
stive charge under conditions of ionization in the MS. This feature confers improved efficiency of ton^ma^n and 



13 



1 



EP 0 992 511 A1 



°r a " Sens ^ rty 0, f e,e ^ 1 - Particularly in electrospray ionization. The chemical functionality that suooorts an 

ESL'ST ™ y ^ B fr0m ^ W L ■* **- Factore «" ^ease the relative sensZ? ^SKS 
detected by mass spectrometry are discussed in. e.g.. Sunner. J., et al. Anal. Chem. 60A3^7(^I 

[0087] A prefened functionality to facilitate the carrying of a negative charge is an oroanic acid surf, a * nh^nm* 
hjdm* carboxy^ acid L phosphonate. phosphate, tetrazole. suifony, urea. pZo™Zw artt^aT 
[0088] Prefened funcftonalrty to facilitate the carrying of a positive crwge urxler ionization i« alfeha «r 

n JllTnLT:**- EXam £!f K° f funCtional ^ enhanced d-J3£S£SE ^25 

nary ammes (/.e amines that have four bonds, each to carbon atoms, see Aebersold U S Patent nTsSo VS£ 

oTZT"TJ' e - ""S" * at ?T? b ° ndS - ea * to »*■» atoms. whichlnSudes 'SSSiSSffS « 
P^fj" Pyndme. see Hess et al.. Anal. Biochem. 224:373. 1995; Bures et al.. Anal. Biochem 224^1 Hin 
dered tertiary armies are particularly prefened. Tertiary and quaternary amines may be aferf oVaS 4 
mo,ety must bear at least one ionizaWe species, but may pLess ™e ^e sSet^^S 

charge state is a single ionized species per tag. Accordingly^s preferred thaVeach T% ^lh5™ 2 , P I? !? 
ta^variabfe component) contain only a single hindered aTne « ^ "»«y (and each 

[0089J Su-table amine^ontaining radicals that may form part of the T™-containir,g moiety include the following: 



f-\^-0-{Cr-C^- N(C,- 



C|o>2 



(C|— C l0 ) 



| — (C|—C )0 )-i/ ^ 




y 



N-(C t — C I0 ); 



|-(C|-C, 0 )-N^N 



CNH-CC.-C.o)-^^ . 



C, 0 ) 



•CNH~(C 2 -C I0 )-N 
O ^ 




•CNH 
II 
O 



-(Cz—C.oh-NCC.-C.o^; 



•CN 
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/ \ 
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£2?n. ! !^ C f' 0n i ° fataQby mass spectrometry is preferably based upon rts molecular mass to charge ratio 
mfc). The prefened rnolecutar mass range of MS tags is from about 100 to 2.000 daltons. and preferably the 7™^Sv 
taming moiety has a mass of at least about 250 daltons. more preferably at least about 300 daltons. and still more oref- 
erably at least about 350 daltons. It is generally difficult for mass spectrometers to distinguish an^moieti™ hS, 
parent .ons below about 200-250 daltons (depending on the precise instrument), andtous pr^^S^S 
moieties of the invention have masses above that range. M -containing 

[0091 ] As explained above, the T^-containing moiety may contain atoms other than those present in the tao vari- 
able component, and indeed other than present in T m " itself Accordingly, the mass of 1™ itself mav be less than J™* 

250 daltons. Thus, the mass of "P™ mav 
range from 15 (ie. a methyl radical) to about 10.000 daltons. and preferably ranges from 100 to about 5 000 daltons 
and more preferably ranges from about 200 to about 1. 000 daltons aanons. 
[00921 It is relatively difficult to distinguish tags by mass spectrometry when those tags incorporate atoms that have 

s^^S^ abundance - pr9,efT9d T «•** Se^SL^Lsp^ 

scopic identification (T™ groups), contain carbon, at least one of hydrogen and fluoride, and optional atoms selected 
from oxygen, nitrogen, sulfur, phosphorus and iodine. While other atoms may be present in the T™ their presencecan 
renderanalyss of the mass spectral data somewhat more difficult Preferably, the T™ groups have only carbon niSo- 
gen and oxygen atoms, in addition to hydrogen and/or fluoride. oniy carocn. nitro- 

[00931 Huoride is an optional yet preferred atom to have in a T™ group. In comparison to hydrogen, fluoride is. of 

^J? PrSSenCe °' " UOride atoms ^ hydrogen atoms leads toT^ grcW higher 
^n JS^TlT T T ,Q rSaCh ^ ~ 3 m3Sfi * neater than 250 daltons. «|*hSS£2 £ 
explained above. In addition, the replacement of hydrogen with fluoride confers greater volatility on the T^-containi™ 
m„e*and greater volatility of toe analyte enhances sensory when mass spectLetry is 5K2 "esto^eS 

HfTi and^ ^^^ll^lf 1 ' 8 Within SC ° Pe * C i-5CoNo- 1 ooO b - 1 ooS,, 1 oP ( , 1 oH a F (J , 4 wherein the sum 
of a. p and S is suftaent to satisfy the otherwise ur«atisfied valencies of the C. N. O. S and P atoms The designation 

<- i-soo N 0 . iooUo. "w 5 * i oPo- 1 o H a F p l 8 means that 1™ contains at least one. and may contain any number from 1 to snn 
carbon atoms, in addition to optionally containing as many as 100 nitrogen atomsfV means^t^n^S noTcon^ 
tain any nitiogen atornsK and as many as 100 oxygen atoms, and as many as 10 sulfur atoms and as many aTlOohS- 

2? Symb °'l a - P and 8 *e number of hydrogen, fluoride and iodide atonS inT^ wh^re a^ 

two of these numbers may be zero, and where the sum of these numbers equals the total of the otoerwfea'u^TrL!^ 
uaianrioo rrf r. m o CanrfDM. n--* ^m. . ^ « o u 10 luuu m me oineTwise unsatisfied 



valencies of the C. N O. S and P atoms. Preferably. T"» has a molecular formula that falls within the scope ofTST 
mSefy " sum of a and P equals the number of hydrogen and fluoride atoms, respactivelypreseni in the 



respectively, present in the 



35 b. Characteristics of IR Tags 
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[00951 There are two primary forms of IR detection of organic chemical groups: Raman scatterino. IR and ahe «m. 
lion IR. Raman scattering IR spectra and absorption IR spectra are complerrLaW^S^ 
Raman excitation depends on bond po.arizabi.ity changes whereas IR absorptii^S^to^oSi e moS 
changes WeaklR absorption lines become strong Raman lines and vice versTwavenumber is SSJSSIS 

W '"T,,™? f" 3 ^TT regi0nS ** ,R te9S whicn have se P arate applications: near IR at 'iSSSSS en, 
, mid IR at 4000 to 600 cm \ far IR at 600 to 30 cm '. For toe uses described herein where a conpound Ms Z seTe 
as a tag to identify an MOI. probe or primer, the mid spectral regions would be preferred. For e^3etoe c^rtowl 
stietch ( 850 101750 i™ 1 ) would be measured for carboxylic acids, carboxylic esters and aLet^'Xa^ 
carbonates, carbamates and ketones. N-H bending (1750 to 160 cm ') would be used to identify aminj T ammonium 
ions, and amides At 1400 to 1250 cm"'. R-OH bending is detected as well as the C-N stretS ^n^^^T 
•Hunan patterns are detected at 900 to 690 cm"' (C-H bending. N-H bending for MN^tZ^t^nl ^- 
matic rings. dou«e and triple bonds, esters, acetals. ketals. ammonium sal* N-O co^S as olTnl 

J* * 7^ hydr ! 20n6S ' quinon98 ' cafbo ^' c acid *- amides, and lactams all possess vtoatto* m£ 

red correlation data (see Pretsch et al.. Spectra! Data for Structure Determination of Organ* CompourSs^ZZ 
Verlag New York. 1989)^refened compounds would include an aromatic nitrite which e^7^2^^B 

S ! 6 C K m9 "ST" 1 f 30 t0 2210 Cm 0lhar ^ ° f C °"W** are aromatic alkynes which haveTsto^g 
stretching vtorat.cn that gives rise to a sharp absorption band between 2140 and 2100 cm" 1 A third co^oun^CTi! 

the aromatic azides which exhibit an intense absorption band in toe 2160 to 2120 cm 1 region Thiocvanatesare?^^ 
sentative of compounds that have a strong absorption at 2275 to 2263 cm' 1 yanates are repre " 
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c. Characteristics ofUV Tags 

[0096] A compilation of organic chromophobe types and their respective UV-visiWe properties is nh»n « o_ 
(Interpretation of the UV Spectra of Natural Products. Permagon PressNew Yo* ilS»TS™1 9 ^ 
or group of atoms or electrons that are r^n^ ^ Z^2^L T" York< 1 ^ . chromophore e an atom 



n* maxima in conjugated systems (see^re^ch"et"aT ^^/Xf^^^?? rU,8S ^ ^ w 10 
Compounds, p. 665 and 870. Sprinoer-VertaaT^oH; fSS%J^J? Determmation of Organic 

would possess^ to and , »3Z5£^ 

» trjzr G - 8rfl,iant 0,1,9 G - 00090 Red - <*^£^^ 

Methyl Orange. Methyl Violet B Naohtol Gr»n R na ri. .~ m n-. d~j ^ ^ 7 , * eiani Ye,KJW . Methylene Blue. 
3. and Sudan Oran^G. at, 

are listed in. e.g.. Jane. I., et al.. J. Chrom. 323.191-225 (1985) M.lwaukee. Wl). Other suitable compounds 

d. Characteristic of a Fluorescent Tag 

is 

[0097] Fluorescent probes are identified and quantitated most directlv hv aw , 

Slon lengths and intensities. Emission ^ec^LT^^^^^Z^^ Z^L^Z 

permit more specific measurements than absomfion ™,< ^1^1 TT ; much more sensitive and 

lifetime and fluorescence anisotropy iTl^ZT^rtf^Z photcp^ys.ca! characteristics such as excited-state 
» extinction coefficient (c) for aSo^a^X^mtie^ # generally useful ^slty parang are the molar 
g.e wavelength (usual* the absorb ^TnZo^ - * 

over the entire fluorescence spectral profile A narrow r^^L^.Zl , « a 4 measure 01 ,otal Photon emission 
J— I- absorb. wherSs »e ^e^enT^^ 

impXce is approxirLelylo ^ »"■*- 
orescenttags are as foflows: fluorescein, rbodamine. L£ bi?4% toEl£ £2 f^lf J!"- 

acr^ine orange, and proptdium iodde. which are commercially avaiiaSe^^ 

oo gS^STSS SUCh 38 ^ T6XaS Red ' lissa —~- BODIPV™ s a^SES MofeST 

e. Characteristics of Potentiometric Tags 

When certain compounds are subjected ToTdoS^h^^ ^ thus producmg a current which can be measured, 
the working elects" surfaced ^^^^^'ZZnT^^l * 
be electronic and undergo electrochemical reactions "b^^ 1^°^^ to 

the HPLC eluent flows Electroactive compounds elutinn frnm^T^h i * ^ 9 ® 3,1 e,ectrode over which 

- eiectrons (reduce, general a currem Z^Smp^^^ «•*• 
the concentration of the analyte and the voltaoe ar^liJ ^^l Zl^l ^l l generated depends on both 

begins to oxidize or reduce. The cunem£ ^ %Z ^^SSJ!SS- fh" 9 3 ***** « **** rt 

the potential is Kept constant and the cuLrt pH2 t^^^ Tc^^Z^T^ 

[0099] When the efficiency of detection is 100% the sneeiAii^oH Hai^ * 
detectors are sensitive which have a number ^£XSSSL T^JZ^T^ 
make these types of detectors useful in an array In coulome^deSrT selectn/rty ™* sensitivity which 
na, current is plotted as a function of the appSd P^t^^^Z^^Z^ 0 -- ** T 
o graph is called the current-voltage curve or hydrodynamic vortarnmagram SSjlSSS JSL StS TZ^Z 
applied potential to the working electrode that permits one to ,1 \T . . e best ^O'C© of 

[0100] Numerous chemicals and compounds are PlArtrrv+iami^iiw ^ . _,. 

ceuticals and pesticides ChromatographlS^e,^ 
f potentials (the potential at half signal maximum) differ by only 30-60 mV eneCl,Ve,y resolved even rf the,r half-wave 
[0101] Recently developed coulometric sensors provide selecHvitu Manfiis^.ti i ^ 

when * h ^ -«mJ55KS22J~lS S2^°SS5.'S 
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principle limited only by the rate at which data can be acquired. The number of compounds which can be resolved on 
the EC array is chromatographically limited (i.e.. plate count limited). However, if two or more compounds that chrorna- 
tographically co-elute have a difference in naff wave potentials of 30-60 mV. the array is able to distinguish the com- 
pounds. The ability of a compound to be electrochemically active relies on the possession of an EC active oxoud fi a. - 
s OH, -O, -N. -S). " v 

[0102] Compounds which have been successfully detected using coulometric detectors include 5-hydroxytryp- 
tamine. 3-methoxy-4-hydroxyphenyl-glycd. homogentisic acid, dopamine, metanephrine. 3-hydroxykynureninr ace- 
taminophen, 3-hydroxytryptophol. 5-hydroxyindoleacetic acid, octanesultonic acid, phenol, o-cresol, pyrogalld 2- 
nitrophenot. 4-rritrophenol. 2.4-dinitrophenol. 4,frdinitnxresol. 3-methyl-2-nrtrophend, 2.4-dichlorophenol. 2.6-dichlo- 
io rophend, 2.4,5-trichtorophenol, 4-chloro-3-methylphenol. 5-methylphenol. 4-methyl-2-nitrophenol. 2-hydroxyanifine 4- 
hydroxy aniline, 1 ,2-phenylenediamine. benzocatechin. buturan. chlortholuron. diuron. isoproturon. linuron. methobro- 
muron. metoxuron. monolinuron. monuron. methionine, tryptophan, tyrosine. 4-aminobenzoic acid. 4-hydroxybenzoic 
acid. 4-hydroxycoumaric add, 7-methoxycoumarin. apigenin baicalein. caffeic add. catechin. centaurdn. chtorogenic 
acid, daidzein. datiscetin. diosmetln. epicatechin galiate. epigallo catechin. epigaJlo catechin gailate. augend eupa- 
is tonn. fenJic acid, fisetin. galangin. gallic add. gardenia genistein. gentisic acid, hesperkfin. irigenin. kaemferd. ieucoy- 
anidin. luteolin. mangostin. morin. myricetin. naringin. narirutin. pdargondin. peontdin. phloretin. pratensdn 
protocatechuic acid, rhamnetin. quercetin. sakuranetin. scutellarein, scopoletin. syringaldehyde. syringic add. tangeri- 
tin. troxerutin. umbelliferone. vanillic acid. 1.3-dimethyf tetrahydroisoquindine. 6-hydroxydopamine. r-sateolind N- 
methyl-f-saJsolind, tetrahydrdsoquindine. amitriptyline. apomorphine. capsaicin. cHordiazepoxide. cWorpromazine 
20 daunorubidn. despramne. doxepin. fluoxetine, flurazepam. imipramine. isoproterend. methoxamine. morphine mor- 
phine-3-glucuronde. nortriptyline, oxazepam, phenylephrine, trimipramine. ascorbic acid. N-acetyt serotonin 3 4-dihy- 
droxybenzylamine. 3.4-cfihydr(wymanddic add (DOM A), 3.4-dihydroxyphenylacetic acid (DOPAC) 3 4- 
dihydroxyphenylalanine (L-OOPA). 3.4-dihydroxyphenylglycol (DHPG), 3-hydroxyanthranilic add. 2-hydroxyphenylace- 
tic acid (2HPAC). 4-hydroxybenzoic add (4HBAC). 5-hydroxyindde-3-acetic acid (5HIAA). 3-hydroxykynurenine 3- 
25 hydroxymandelic add. 3-hydroxy-4-methoxyphenylethylamine. 4-hydroxyphenylacetic acid (4HPAC) 4-hydroxyphenyl- 
latf ic acid (4HPLA). 5-hydroxytryptophan (5HTP). 5-hydraxytryptophd (5HTOL). 5-hydroxylryptamine (5HT) 5-hydrox- 
ytryptamine sulfate. 3-methoxy-4-hydroxyphenylgfycd (MHPG). 5-methoxytryptamine, 5-methoxytryptophan 5- 
methoxytryptophd. 3-methoxytyramine (3MT), 3-methoxytyrosine (3-OM-DOPA). 5-methy (cysteine. 3-methylguanine 
bufotenm. dopamine dopamine-3-glucuronide. dopamine-3-sulfate. dopamine-4-sulfate. epinephrine, epinine fbBo 
so acid, glutathione (reduced), guanine, guanosine. homogentisic add (HGA). homovaniBic add (HVA). homovanilM alco- 
hol (HVOL). homcveratic acid, hva sdfate. hypoxanthine. indole, indde-3-acetic acid, indole-3-lactic acid, kynurenine 
mdatonin. metanephnne. N-methyltryptamine. N-methyHyramine. N.N-dimethyttryptamine. N.N-dimethy1ty^amine , 
norepinephrine, normetanephrine. odopamine. pyridoxal. pyridoxal phosphate, pyridoxantine. synephrine. tryptophol 
tryptamine. tyramine. uric acid, vanillylmandelic acid (vma). xanthine and xanthosine. Other suitable compounds are set 
35 forth in. e.g,. Jane. I., et al. J. Chrom. 323:191-225 (1985) and Musch. Q. et ai. J. Chrom. 348:97-110 (1985) These 
compounds can be incorporated into compounds of formula T-L-X by methods Known in the art For example, com- 
pounds having a carboxylic acid group may be readed with amine, hydroxyf. etc. to form amide, ester and other link- 
ages between T and l_ 

[01 03] In addition to the above properties, and regardless of the intended detection method, it is prefened that the 
40 tag have a modular chemical strudure. This aids in the construdion of large numbers of structurally related tags using 
the techniques at combinatorial chemistry. For example, the T™ group desirady has several properties It desirably 
contains a functional group which supports a single ionized charge state when the T^-containing moiety is subjected 
to mass spectrometry (more simply referred to as a "mass spec sensitivity enhancer" group, or MSSE) Also it desirably 
can serve as one member in a family of T n, *-containing moieties, where members of the family each have a different 
45 mass/charge ratio, however have approximately the same sensitivity in the mass spectrometer. Thus, the members of 
the family desirady have the same MSSE. In order to allow the creation of families of compounds, it has been found 
convenient to generate tag reactants via a moddar syntheds scheme, so that the tag components themselves may be 
viewed as comprising modules. 

[0104] In a preferred modular approach to the structure of the T ms group. T" 8 has the formula 



T 2 <J-T 3 \- 



55 



wherein T 2 is an organic moiety formed from carbon and one or more of hydrogen, fluoride, iodide oxygen nitrogen 
sulfur and phosphorus, having a mass range of 15 to 500 daltons; T 3 is an organic mdety formed from carbon and one 
or more of hydrogen, fluoride, iodide, oxygen, nitrogen, sulfur and phosphorus, having a mass range of 50 to 1000 dal- 
tons; J is a direct bond or a functional group such as amide, ester, amine, sulfide, ether, thioester. disulfide thioether 
urea, thiourea, carbamate, thiocarbamate. Schiff base, reduced Schiff base, irrine. oxime. hydrazone. phosphate' 
phosphonate. phosphoramide. phosphonamide. sulfonate, sulfonamide or carbon-carbon bond; and n is an integer 
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HE? iT °V 10 * SUCh ** " iS 9reater ^ 1. each T3 and J is independently selected 
[0105] The modular structure T 2 -(J-T 3 )„- nrovides a mm/onionf #« ^~;r it. ^ 

adjuster group(-WRA"). A WRA is ^JLun^J^^^S^2^ ** * ^ ^ 

-a extending over a limited range. A single set Z"££l nCifuS toc-l^I. 9 IT hw » masses 

simply by incorporating one or more WRA gro^o^e^^^^TI^ 8 ^ 3 mass 
afton* a mass range of 250-340 daltons tor me aStSon ^rsiZwRHll^ ^ " * ^ 01 ^ 

1 00 oalton. as a T3 group provides access to the mas; rang^^^So^ e ^^^^^ numb * 
Similarly, the addition of two 1 00 dalton MWA grouos (eadi S ^ the same set of T 2 groups. 

» 540 daltons. where this incremental addition of £f£ £L » beZS^ 10 ^ ^ 01 450 * 
range tor the T™ group, Preferred corrpound^ ZuTa ^^T^ l^^e'^^n^^^^ '^T^ 
L-X where VWCisa T group, and each of the WRA and MSSE groun i am^^T^ ^1^^' Rmsse " 
in Rgure 13. and represents one modular approach to C^tfcTof^ * "** ^ ' 8 ,,,UStrated 

> Zahydro^sC^ 
carbyOh^^ 

wherein the heteroatom(s) are SSXJSS^^ heterocydylhydrocarbyl 
carbyl wherein the heteroatomfs) are «le^f «vC^? 1 u. phospnorus - s ^^ed heterocyclylhydro- 

se.ected from hydro<*rt£^ P^horus and the substrtuents are 

ylene. N-(hydrocartoyl)hydrocarbylene N N^^^^^aabvf^ivdm^^^i^ ~^^!^ C ^^^ en9 ' ^ t ' roca '* , y''^* n l' c ' roc arb- 
ad^ar*^ h 
hydrogens are replaced fluorides. ^ 7 1 sroups ' that one or more 

2L!fL • ^ fe9 f ""i" 9 *!! formUla T2 - (J - j3 ->n-. a preferred T* has the formula -G(R*)- wherein G is C allcrf*™ 
chain having a single R 2 substrtuent Thus if G is sthvi*™ _ru \ ^TJ ' efe,n u ,s c i-s alkytene 

arvfnma ciihc«i«.rf^ . . . 7 "•"""■> ur amynyiamino. arytoxy. arylarrnno. N-alkyturea-substituted alkvl N- 

aryiurea-substrtuted alkyl, alkylcarbonytamino-substituted alkvl aminrw-artw.wi.ci ,hc«, ' «_.. J™ 0 " 1 """ dl *y. n- 
cyclyt-substrtuted alkyl. heterocydyl-substtuted amirTcarbSv^I alk* heterocyclyl. hetero- 
heterocyc.yla.kyl; cydoalken^arJ-subsSS l^ aS^^^oTsS^^ T^T"^'^ "* 
aralkoxy-substrtuted alkyl. aJkoxy-substituted alkvl Li^w^^.S M ' alkoxy-substituted alkyl. 

alMoxy^rt»nylamino^ubSed^ amtno-substftuted alkyl. (aryl-substituted 

alkytt hio,-subs«uted a/kyl. »*al^^ 

""rphoTocarbor^-suc^^ carbonyl-substituted alkyl. thio- 

aminolcarbony.-substitu.ed alkyl. SeVocyS^^ cfalkenyl^ialkyny. or (alkyl. alkenyl)- 

arbonyt-substituted a.kyl. hic^SSS^ 
N.N^ia^nalMenean^ocarbor^^subst^ed alkyl. atky, 

ZfJ^Z ^- Ca,bOXy, - SUbStitUted ^.kylaminb-substftuted acy^noatk^d Si 
selected from argmine, asparag ne. alutamine. S-m<*hvi f*,**~t~ A kU . ^ ' amino aaa side chains 

derivatives thereof. glydneJeu^ne. Sc^e COrres P° ndin 9 and sulfone 

tophan. proline, a.an^e. ornithine. h^SJ^ ^ tryP ' 

allothreonine; afynyl and heterocyclylcarbonvl. aminocarbon^ ^ o. r^oTL rS^o^ be ^- c y anoa,an '^. and 
ylaminocarbonyl. alkylarylaminocarbonyl diLylamin^^vi ^i^T , d,alk y ,am ' noc arbonyl. mono- or diar- 
acv. a.ky, optionaHy suited by *±J2S?X 

[01 091 A preferred compound of the formula T 2 -(J-T 3 -) n -L-X has the structure: 
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T 2 



10 



Amide 

I 

O (CH 2 ) C 



wherein iQ is (CH^ such that a hydrogen on one and only one of the CH, groups represented bv a sind- -rs- « 
,5 replaced w,th-<CH 2 ) c -Aniide-r»; T 2 and T 4 are organic moieties of the formulae; J£,cT Q H i^TJL Z 

a and p is sufficient to satisfy the otherwise unsatisfied valencies of the 0 ^0^^ °' 

O O 
II II 

a> — N-C — or — C-N— ; 

1 1 1 1 

R 1 R 1 



25 



R 1 is hydrogen or C„ 0 alkyl; c is an integer ranging from 0 to 4; and n is an integer ranging from 1 to 50 such that when 
n is greater than 1. G. c, Amide, R 1 and T* are independently selected. 8,1 
[01 1 0] In a further preferred embodiment, a compound of the formula T 2 -(J-T 3 -) n -L-X has the structure: 



30 



T 4 
I 

An 



35 



I 

O (CHj), R i Q 

r' ° (CH 2 ) C 

Amide 

^l^i 8 " ° r ^ mo j ety « "]! formu,a C,- aS N 0 . 9 O 0 . 9 H a F (l such that the sum of a and p is sufficient to satisfy 
the otherwise unsatisfied valencies of the C. N, and O atoms; and T 5 includes a tertiary or quaternary amine ^rS 
organic aad; m is an integer ranging from 0-49. and T 2 , T 4 . R 1 . Land X have been previously deSS" 
[01 1 1 ] Another preferred compound having the formula T 2 -(J-T 3 -)„-L-X has the particular structure: 
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to 




13 



20 



wherein T 5 is an organic moiety of the formula C, , e M, „n up*,, 

the otherwise unsatisfied valencies Sic N S^S^S.^?- ^ * " 8nd P * to ^ 

organic acid; m is an integer mn&VZwZtf ZT^^fZVV*" °' » - 

[01 1 21 m the above structures tat have a TW> " *T, ' ^ " ^ PWi ° US,y d9,ined 

ientty made by reacting organic adds ^S^S^SSS^ *" '°" 0 * n9 ' ^ ^ 



NHC 
II 



25 




30 



O N 1? \^/-°-^- C .o)-N(C I --C, 0 ) 2 

(Ct-C I0 ) 

NH£-(C,-C, 0 )-/^ ; ~NHC-(C 0 -C, 0 )-r P S . 




NH ?-<N-(C,~C 10 ); and - NHC_( C) -C, 0 )~N 

O ^— ' H [ 

Sent ^"^r ssss ^j^^sar- a free r*** ^ «* — - 

- the approve organic amine with aVee Z^^^^T™^™"™' * * 
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O 




CNH-(C,-C 10 )-^ V — CNH-(C,-C 



II 
O 




o 




CNH-(C,_C 10h _( V CNH — (Cj—^ l0 ) — \i 




II 

o 




<Cr-C l0 ) 

-Chm-(C 2 -C I0 )-N^) ; 




| NH -<C 2 -C l0 )-.N<C,-C l0 ) 2 ; — ^H-(C 2 -C lfl >-/~] 




50 [01 14] In three preferred embodiments of the invention. T-L-MOI has the structure: 
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C, 0 ) — ODN — 3- — OH 
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or the structure: 
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O 

to H 



13 




C|o) — ODN — 3I—0H 



or the structure: 

20 



O 



25 1 G' j 

<CH2) C R 

Amide , 
1 O (C, C|q) — ODN — 3- — OH 



JO 



35 



wherein T 2 and T 4 are organic moieties of the formula C« o*N rt «r>~ p u c i ^ 

sufficient to satisfy the otherwise unsatisfied valencfi - tScwTosSSftl^ HS?,""^! * * ^ 8 * 
only one hydrogen on the CH, groups represented by each g7s replac^^^ ™ 



O O 

II II 

—N-C — or — C-N — ; 
40 • * I , 



45 



SO 



fro-T^ a hydrocan.ylene group having 

a nucleic acid fragment joined ,0 (C,-C 1o) ShT^^^ ■ 
ing from 1 to 50 such that when n is greater than 1 then Q c A™1 «ul ^4 fragment); and n is an integer rang- 
there are no, three heteroatoms bonded to a^e ^ atonTwherein^? ZT^^ 
C 1 . 25 N 0 . 9 O 0 . 9 H a F p such that the sum of a and a is M^^L ZlJT^ ° ^ m0,etles 01 fo ™la 

and O atoms; G isWs wherein one an^y on ^S^^S^ST^^^*^ °' C " * 
wfth-fCH^-Amide-T 4 : Amide is nyorogen on the CH 2 groups represented by each G is replaced 



O O 
II II 
— N-C — or — C-N—; 

55 I I 

R. R 1 
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R« is hydrogen or C,. 10 aJkyf; c is an integer ranging from 0 to 4; "ODN-3--OH" represents a nudeic acid fragment hav- 
ing a terminal 3 hydroxy! group; and n is an integer ranging from 1 to 50 such that when n is greater than t G c Amide 
R ' and T 4 are independently selected. ' fvnaB - 

i^2„» JllSTT ulS^J*™ 6 1,131 C ° ntain a T'-Ct'O-NfR 1 )- group, this group may be formed by reacting 
an amine of the formula HN(R»). with an organic acid selected from the following, which are exemplary only and do not 

tc^XTo T£ !? ? f ^ 0r9aniC addS: FormiC add - ^ **** add - P^K* acid. Ruoro- 
acetic acid, 2-Butynoic acid, Cydopropanecarboxylic acid. Butyric acid. Methaxyacetic acid Difluoroacetic acid 4-Pen 

tyno« acid Cyclo^ecarboxylic acid. 3.3-Dimethylacrylic acid. Valerie acid. N.N-Dimettylglycine. N-Fbrmyl-Gly-OH 
Ethoryacetic aad (Methytthio,acetic acid. Pyrro.e-2-carboxy.ic acid. 3-Furoic add. SSS^^^Z^ 
> 3-Hexeno« aod. Trrffooroacetc acid. Hexanoic acid. Ac^iy-OH. 2-Hydroxy-2-methybutyric acid. Zc acid Ntet 
imc aad. a-Pyrazjnecartwxyltt; acid. l-Methyl-2-pyndecarboxylic acid. 2-Cydopentene-1 -acetic acid. Cyctopentytact 
^ a ^J? ) ( "2" P ^°^ 5K:art,OXyliC add - N Methyl-L-pro.ine. Heptanoic acid. Ac-b-AJa-OH 

^aT^D?^ ,>T0,UiC a ° d - acid. S-Metn^^necaZcy- 

hc aod. 2,5-Dimethy1pyrrole-3-cart>oxylic aod. 4-Ruorobenzoic acid. 3,5-Dimethylisoxazole-t-carbcDtylic add 3- 
Cyclopentyjprop.onic add. Ocfanoic add. N.N-Dimethylsuccinamic acid. Phenylpropidic add. Cinnan^acid 4-Emtf - 
benzoic aod p-Anisic acid. 1.2.5-Trime1hylpyrrde-3^rboxylic acid. 3-Ruoro-t-methylbenzoic acid. Ac-OL-ProDar- 

S2ST!J 1 9 r ^ iyb lTi ''^^ epr0pi ° nlC add ' N^W"** 3.5-Drffoorobenzoic acid. 2cX 
VaNOH rxto^-ca/boxylic aod. 2-Benzofurancaiboxylic add. Benzofriazde-5-cart>oxylic acid. 4-n-Propyfoeruo.c 

L^^^ 4-Etnox^enxoic acid. 4KMethy«h to )ben 2 o fc ^^ 2 -Furoy^glydne!?S^ 
^nicotinic aod. 3-Ruoro-4-methoxybenzoic acid. Tfa-GI y -OH. 2-Napthoic acid. Quinaldic acid IAd-L-ll e-OH3- 
Methylindene-2-carboxylic acid. 2-Quinoxalinecarboxyic acid. 1-Methylindole-2-cartx>xyfic acid 2 3 

fr« ZLtl -D'methoxynK* tmc aa d. Cyclohexanepentanoic acid. 2-Naphthylacetic acid. 4K1H-Pynd-1-yl)benzdc 
SlSfS ?? £ uoromethylbenzoic acid. S-Melhoxyindo.e-2-cartDoxyfic add. 4-Peifbenzoic add 
Bz-b-Ala-OH. 4-O.ethylammobenzoc acid. 4-n-Butoxybenzoic acid. 3-Methyl-5-CF3-isoxazde^-cariic add (at 
D,me1hoxyphenyl)acetic acid. 4-Biphenytaarboxylic acid. Pivaloyl-Pro-OH. Octenoyf-Qy-OH (2^hSTacSc a<£ 

e%0Pheny,aCefc 3dd ' ^ethcxyindo.e-3-aSc add. J#E!!Z!X2 

bSSc 2 N PnTvSl^S Et^d^S-cart^te acid. N-<2-Ruorophenyl)sucdnamic acid. 3.4.5-Trimethox;- 
a^Ts^J^^^^^ 3Cid ' Nona "oy*-Q'y-OH. 2-Phenoxypyndir^3-can>o^ 

aod. 2.5-D ime th y l-^pj 1 eny^yrrol e -3K^rtx ) xyl l c acid. trans-4-<Trrfluoromethyl)cinnamic acid. (5-Metnyl-2-phenyloxJol- 
4-yl)aceta aod 4-<2-Cydohexeny.oxy)benzoic acid. 5-Methoxy-2-methylrndole^-acetic add tranSSnSS- 

L a «^^IC^ eW ' 4 ^ exyl °^ t38nz0ic aci * N-O-MethoxyphenyDsucdnamic acid. Z-Sv^R^. 
Dimethoxyphenyljbjtyric aod. Ac-o-Ruoro-DL-Phe-OH. N-(4-Ruorophenyl)glutaramic acid. 4^Ethyl-4*iph8nylcaroox- 
y ac ^i ^^a^Tetraj^oacrdinecari^xylic acid. 3-Phenoxyphenylacetic acid N-(2.4-Diffoo^h3SSS 
acid. N-Oecanoyl-Gly-OH. W^-Methoxy-a-melhy.-2-naphthaleneacetic add. 3-<Trifluorome^^ 
Formyl-DL-Trp^OH. (R)-( + )-a-Meth<»y-aKtrrffooromethyl)pheny l acetic acid. Bz-DL-Leu-OH. r^luwometfuxmtohe- 
noxyacetc aod. 4-Hepty.oxybenzoic add. 2.3.4-Trimelhoxydnnamic acid. 2.6-Din^thoxybenzoSy^ Ts-S?^ 
n^^rt^T^ !, cid - 2.3.4.5.6-Pentaf.uorophenoxyacetic acid. N-(2.4-Difluorophen^)dutaramic add M- 
Undecanoyl-Gly-OH, 2-(4-Ruorobenzoyl)benzdc acid. 5-Triffooromethc«yindole-2-carboxylic addT^^luoraohl 
nylWycolamic acid. Ac-L-Trp-OH. Tfa-L-Pheny.g.ycine-OH. 3-.cxJoben7oic acid. 3-(W^er^£^^ 
% ™Xf!2S1Sr^^ 4-Octy.oxybenzoic acid. Bz-L-Met-OH. ^S-k-KSSSSTSSS^SS 
Gly-OH 3 5-B.s(trrf tuor omethyDbenzoic add. Ac-5-Methyl-DL-Trp-OH. 2-lodophenylacetic acid. 3-TodS mettSben- 
zoic aad. 3^4-n-HexyfoenzoyOpropfonic acid. N-Hexanoy.-L-Phe-OH, 4-NonyToxybenzdc acid 4^fo^l1C-2- 
b-phenytearboxyUc aad. Bz-L-Phe-OH. N-Tridecanoy.-G.y-OH. S.S-Bi^rffoorcxneinyiJphenylacetic acid 3^Sy1- 
benzoyOpropionic aod. N-HepytanoyLL-Phe^H. 4-Decyloxybenzoic acid. N-(a.a.a P ^^ 
Nrflurmc aad. 4-(2-H y droxyhexafIuoro.sopro Py l)benzoi C acid. N-Myristoyl-Gly-OH. 3-(4-n-Octyfoer«oyl)prooionS add' 
N-Octanoyl-L-Phe-OH. 4-Undecy.oxybenzoic acd. S^S^.S-Trimethoxyphenyipropioril-GlySI T 8 7o^rSnofo acd 
^H^f ?^[ y ' 4 - Dode ^ ,ox V be "^ -«**. M-Pa.m rt oy.-G.y<)H y r^-Steara?-a y ^ 
rl T "TS^T °°V* 01 **™ nCea ChemTech - ^ isvi »«- Bachem Bioscience Inc.. Tonice C T 
Sr ^ M^° Vab,OChem °° rP - San Di69 °- CA: Farchan Laboratories Inc.. Gainesville PL; Lancaster Svnthesfe 
W,ndham NH; and MayBrkige Chemical Company (c/o Ryan Scientific). Columbia. SC. The catatoS ^from thie cot 
pan.es use the abreviations which are used above to identify the acids caia,ogs from these com- 

f. Combinatorial Chemistry as a Means for Preparing Tags 

[01 1 6] Combinatorial chemistry is a type of synthetic strategy which leads to the production of large chemical librar- 
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\ZlZT^l^% PCT < W™*™™^™ No- WO 94/08051). These combinatorial libraries can be used as 

?„T 0< '™l ecU,eS 01 in,erest < MOte >- Combinatorial chemistry may be defined^^ systmaS 
I^T^'h «™«*°n of a set of diffe '«« "buftinB blocks- of varyTng structures to other 

large array of diverse molecular entries. Building blocks can take many forms, bott, naturally occunrfnnT^I 
5 such as nuc.eophi.es. e.ectrophi,es. dienes. abating or acy.at.ng agel. 23 Trtt Z™ 

lipids, organic monomers, synthons. and combinations of the above. Chemical reactions^*! tn ™^ 2* k 

condensation, and the l.ke. Th.s process can produce libraries of compounds which are oliaornenc^.^S^ 

WWe sequences to be prepared and used. Such libraries can be divided into three broad cTeg^esTne cT^ 

of libraries involves the chemical synthesis of soluble non-sumort-hr...nH n • T ./^ une cale 9° f y 

**:84. 1991). A second category involves ^e^^£T^^^± 

blocks are typically L-am.no acids. D-amino acids, unnatural amino acidTor sWm^e^ ^ 
tWrd category uses molecular biology approaches to oreoare n^^ Z^Z^T T ° COmb ' nat0n t" 9 "** * 
parses or plasmas (Scott and cSSr. C^SSS S?SS 's^n^^ 
appear to be suitable for a number of applications, induding use as tags. The avS^e^e^*en^divS« 
SST ^ be 6KPanded * SUCh 38 P— (Ostresh et J^ES?£uK 



JO 



35 



40 



45 



50 



» • • 11 jo, 1 , - 

S2£3 * N T^ ar L antS ? PeP&ie combinatorial "buries are possible in which the peptide backbone is modified 
ur~f *f ^ S ST 6 , be6n rep,aced by mimetic S^oups. Amide mimetic groups wrtch m^^SuS 
ureas, urethanes. and carbonylmethylene groups. Restructuring the backbone such that, ttectate emlSff™ 
am.de nitrogens of each amino acid, rather than the alpha-carbons gives libraHeVoVcn^^^^Tl £• 
(Simon et ai.. Proc. Natl. Acad. ScL. USA 893367. 1 992) compounds known as peptoids 

[01 1 9] Another common type of oligomeric combinatorial library is the olioonucleotiria romw.-ws-i ^ 
lh.(x»k.^bk«*sa re5 o™ l brmo.na l u ra ., occur*, or un^r^oSS^^SSS^Si^^ 

hormones, drugs, a.kaloids. opioids, terpenes. porphyrins, toxins, catalysis, as well as co^nXi CeoT 9 ' 

g. Specific Methods for Combinatorial Synthesis of Tags 

[01 21 J Two methods for the preparation and use of a diverse set of amino U o * 

in both methods, solid phase synthesis is employed to ^SE2E£S£S a^^ 6 ^ 
tagged linkers, using the techniques of combinatorial chemistrv In JTvZirJlZZ J! ^ ge number 01 

from the oligonucleotide results* .iberatton of to toe ^ZESSZ"*" ****** ~ the 

a carhop ac*. The chemical componer^ ar* SSSS,^ SS^IZS^*" 



" = resin 

FMOC = f luorenylmethoxycarbonyl protecting group 

A" = alJyl protecting group 

55 C0 2 H = carboxylic acid group 

CONH 2 = carboxylic amide group 

NH 2 = amino group 

OH = hydroxy! group 
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CONH = amide linkage 

COO = ester linkage 

NH 2 - Rink - C0 2 H = 4-[( a -amino)-2 f 4Kiimethoxybenzyl]- phenoxybutyric acid (Rink linker) 

OH-1MeO-C0 2 H = (4-hydroxymethyl)phenoxytxjtyric acid 

OH - 2MeO - C0 2 H = (44iydroxymethyi-3^ethaxy)phenaxyacetjc acid 

NH 2 -A-COOH = amino acid with aliphatic or aromatic amine functionality in side chain 

XI . ..Xn-COOH = set of n cfiverse carboxylic acids with unique molecular weights 

oligol... oligo(n) = set of n oligonucleotides 

HBTU = 0^en2otriazol-1-yl-N,N.N\N'-tetramethyluronium hexafluorophosphate 

The sequence of steps in Method 1 is as follows: 

■ 

OH - 2MeO - CONH - R 

i FMOC - NH - Rink - COjH; couple (e g. , HBTU) 
FMOC - NH - Rink - COO - 2MeO - CONH - R 

i piperidine (remove FMOC) 
NH, - Rink - COO - 2MeO - CONH - R 
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i FMOC - NH - A - COOH; couple (e g. , HBTU) 
FMOC - NH - A - CONH - Rink - COO - 2MeO - CONH - R 

i piperidine (remove FMOC) 

NH, - A - CONH - Rink - COO - 2MeO - CONH - R 

■I divide into n aliquots 
44444 couple to n different acids XI .... Xn - COOH 

X1 Xn - CONH - A - CONH - Rink - COO- 2MeO - CONH - R 

44444 Cleave tagged linkers from resin with 1% TFA 

XI Xn - CONH - A -CONH - Rink - C0 2 H 

■ 

couple to n oligos (oligol oligo(n)) 

(e.g., via Pfp esters) 

X1 Xn - CONH - A - CONH - Rink - CONH - oligol oIigo(n) 

^ pool tagged oligos 

| perform sequencing reaction 

+ separate different length fragments from 

sequencing reaction (e.g. , via HPLC or 
* cleave tags from linkers with 25%- 1 00% TFA 

XI Xn - CONH - A - CONH 



analyze by mass spectrometry 



The sequence of steps in Method 2 is as follows: 



OH - 1 MeO - CO, - All 



FMOC - NH - A • C0 2 H; couple (e.g., HBTU) 



FMOC -NH - A - COO - IMeO • C0 2 - All 
i Palladium (remove Allyl) 
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FMOC - NH - A - COO - J MeO - CO z H 

5 i OH - 2MeO - CONH - R; couple (e.g. , HBTU) 

FMOC - NH - A - COO - lMcO - COO - 2McO - CONH - R 

t0 4- piperidine (remove FMOC) 

NH, - A - COO - IMeO - COO - 2MeO- CONH - R 

4 divide into n aJiquots 

' 5 444444 couple to n different acids X I Xn - CO,H 

XI Xn - CONH - A - COO - IMeO - COO - 2MeO - CONH - R 

so 44444 cleave tagged linkers from resin with 1 % TFA 

* 

XI Xn-CONH- A -COO- lMeO-CO,H 

I 

i 

25 i^44i couple to n oligos (oligol oIigo(n» 

{e.g., via Pfp esters) 

XI Xn - CONH - A - COO - IMeO - CONH - oligol oligo(n) 
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l pool tagged oligos 

^ perform sequencing reaction 

i separate different length fragments from 

sequencing reaction (e.g. , via HPLC or CE) 
+ cleave tags from linkers with 25- 1 00% TFA 



XI Xn - CONH - A - CO,H 



analyze by mass spectrometry 



2. Linkers 



[0122] A -linker" component (or L), as used herein, means either a direct covalent bond or an organic chemical 
group which is used to connect a "tag* (or J) to a "molecule of interest" (or MOI) through covalent chemical bonds. In 
addition, the direct bond itself, or one or more bonds within the linker component is cleavable under conditions which 
allows T to be released (in other words, cleaved) from the remainder of the T-L-X compound (including the MOI compo- 
nent). The tag variable component which is present within T should be stable to the cleavage conditions. Preferably the 
cleavage can be accomplished rapidly; within a few minutes and preferably within about 15 seconds or less. 
[0123] In general, a linker is used to connect each of a large set of tags to each of a similarly large set of MOIs 
Typically, a single tag-linker combination is attached to each MOI (to give various T-L-MOI). but in some cases more 
than one tag-linker combination may be attached to each individual MOI (to give various (T-L)n-MOI). In another embod- 
iment of the present invention, two or more tags are bonded to a single linker through multiple, independent sites on the 
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linker, and this multiple tag-linker combination is then bonded to an individual MOI (to give various C0n-L-MOI) 
[0124] After various manipulations of the set of tagged MOIs. special chemical and/or physical conditions are used 

bo^rJT ° r ™* C " a ' ent bon * the linker ' ^ the liberation of the tags iSSSSS 

bond(s) may or may not be some of the same bonds that were formed when the tag. linker and MOI were conS 

STa™* H deS ? n I °l " nke : " determine *• ""di whfon cfeavage may STcSS 

pushed. Accord,ngly ..nkers may be identified by the cleavage conditions they are particularly SSt<»Z^ 

ticufcrly susc^e to cleavage by add. base'chen^^ti^enS TTSS^S^ 
W^-.M electrochemica. oxidation or reduction elevated temperature fJSS- flS 

S deavTrcSo^t^^ ,!°w a "I* ST " C ° nMo "' whereas ^ are ««• *> several 

s^^^^S^rSL ^J^t^J» « ~ «o *• (Tjn-L-MO, type 



„ ^ ' ; ~" r: ~* w ltto " e ro amerem cleavage conditions. For example, in a linker having 

n^i .IT faSS ™ y 56 ,abj,e ° n,y to tese ' and the labil ° °nly to photolysis. 

10126] A linker which is useful in the present invention possesses several attributes: 

1) The linker possesses a chemical handle (L*) through which it can be attached to an MOI 
*!™ e , , P 055 ® 5 !!^ second ' separate chemical handle (L+,) through which the tag is attached to the linker If 
mumple tags are attached to a single linker ((T)n-L-MO. type structures), then a separate r^le^s^tortachtea 
3 The .inker • stable toward all manipulations to which it is subjected, with theTception of SlSSS^Sh 
aBow cleavage such that a T-containing moiety is released from the remainder of the ^np^^^u^s^ 
Thus, the linker « stable during attachment of the tag to the linker, attachment of theTker to the MO? ,^Vrt 
manipulations of the MOI while the tag and linker (T-L) are attached to it *"* 
25 4) The linker does not significantly interfere with the manipulations performed on the MOI while the TL « a » a oh~i 
to ,t For instance, if the T-L is attached to an oligonucleotide, the T-L must inferfe^ J£ 

^T* ^ PCR) performed « *• Cigonudeotida 12.^^^^^; 
antibody, it must not significantly interfere with antigen recognition by the antibody. f0 *" 

5) Cleavage of the tag from the remainder of the compound occurs in a highly controlled manner usina ohvsical or 
so chemical processes that do not adversely affect the detectability of the tag. manner, using physical or 

[01271 For any given linker, it is preferred that the linker be attachable to a wide varietv of MOte ann , ...^ 
variety of tags be attachable to the linker. Such flexibility is advantaged V^^SSXS^t SXS 
once prepared, to be used with several different sets of MOIs. conjugates. 
35 [0128] As explained above, a preferred linker has the formula 

U-L'-L^-Lh 

wherein each is a reactive handle tat can be used to link the linker to a tag reactant and a molecule of interest reac- 

1 *" ^ P3rt ? 2 th6 ,inker * bSCauS9 lm P arts «««* to the linker. L 1 and L 3 are S^g^oTwIS 
effectively serve to separate L 2 from the handles L^. opuonai groups wnich 

wf 7— i- (which ' by ^ ir f°* is "earer to T than is L 3 ), serves to separate T from the required labile moiety L a 
Th re separation maybe useful when the cleavage reaction generates particularly reactive spe^es!e afreeTaSLsl 
wh^hmay cause random changes in the structure of the T-containing moiety. As the deavaT^te is Sfc£l£22 
45 the ^-containing moiety, there is a reduced likelihood that reactive species formed at the >£ZZa^25££ 

he structure of the T-containing moiety. Also, as the atoms in Li will typically be present in STXnl^SE 
theseLi atoms may impart a desirable qualrty to the T-containing moie^For exanple. whence T^rfno 2 
,saT™-c 0 nta,n,ng moiety, and a hindered amine is desirably present as part of thesttucture toe V^ZSJ^S 
ety (to serve, e.g.. as a MSSE) the hindered amine may be present in LMabile moiety contaimng ma- 

50 V 3 °! ,M ^ri" 8 * 3 " 068 - Ll and/w L 3 may be present in a linker component merely because the commercial suo- 
plier of a linker chooses to sell the linker in a form having such a U and/or L 3 group. In such an in^Tthe^e is no 

Jrrjr nB ? ^« and/or L 9roups - (so ion9 35 mese sroup *» not «** ^ d^C^SS i£ 

hough they may not contribute any particular performance advantage to the compounds that incorporate Sm Z 
the present invention allows for U and/or L 3 groups to be present in the linker corrponent ,nCOrpCrate them - Thus " 
s [0131J L and/or L groups may be a direct bond (in which case the group is effectively not present) a hvdrocart* 
ylene group (e.g., alkylene. arylene. cydoalkylene. etc.). -O-hydrocarbylene (e a -O-CHo- Sch rSru r^T^ 

X^XT**^^' wherein w is an integerSng,ng ^^^(iS^S^S^ 
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[01 32J With the advent of solid phase synthesis, a great body of literature has developed regarding linkers tat are 
labile to specific reaction conditions. In typical solid phase synthesis, a solid support is bonded through a labile linker to 
a reactive site, and a molecule to be synthesized is generated at the reactive site. When the molecule has been com- 
pletely synthesized, the solid support-linker-molecule construct is subjected to cleavage conditions which releases the 
molecule from the solid support The labile linkers which have been developed for use in this context (or which may be 
used in this context) may also be readily used as the linker reactant in the present invention. 

[0133] Lloyd-Williams. P., et al.. -Convergent Solid-Phase Peptide Synthesis". Tetrahedron Report No. 347 
49(48):1 1 065-1 1 133 (1 993) provides an extensive discussion of linkers which are labile to actinic radiation (/ e phcn 
tolysis). as well as acid, base and other cleavage conditions. Additional sources of information about labile linkers are 
well known in the art 

[01 34] As described above, different linker designs will confer cleavability flabifity") under different specific physical 
or chemical conditions. Examples of conditions which serve to cleave various designs of linker include acid, base oxi- 
dation, reduction, fluoride, thiol exchange, photolysis, and enzymatic conditions. 

[0135] Examples of cleavable linkers that satisfy the general criteria for linkers listed above will be well known to 
those in the art and include those found in the catalog available from Pierce (RocMbid. IL). Examples include: 

• ethylene glycobis(succinimidylsuccinate) (EGS). an amine reactive cross-linking reagent which is cleavable by 
hydroxylamine (1 Mat 37°C for 3-6 hours); 

• disuccinimidyl tartarate (DST) and sulfo-DST. which are amine reactive cross-linking reagents, cleavable by 0 015 
M sodium periodate; 

• bis[2-(sucdnimkJyloxycarbonyloxy)ethyl]sulfone (BSOCOES) and sulfo-BSOCOES. which are amine reactive 
cross-linking reagents, cleavable by base (pH 11.6); 

• 1.4<li-[3^(2'-pyridyWithio(propionamido))butane (DPDPB). a pyridyldithioi crosslinker which is cleavable by thiol 
exchange or reduction; 

• N-[4-(p-azidosalicytamido)^ (APDP), a pyridyldithiol crosslinker which is 
cleavable by thiol exchange or reduction; 

• bis-[beta-4-(azidosalicylamido)ethyl]-disulfide. a photoreactive crosslinker which is cleavable by thiol exchange or 
reduction; 

• N-sucdnimidyH4-azidophenyl)-1 .3'dithiopropionate (SADP). a photoreactive crosslinker which is cleavable by thiol 
exchange or reduction; 

• sulfosucdrTimidyl-2-(7-azido-4-meth^ (SAED) a photoreactive 
crosslinker which is cleavable by thiol exchange or reduction; 

• sulfosucdriimidyl-2-(m-azid<><)^ (SAND), a photoreactive crosslinker 
which is cleavable by thiol exchange or reduction. 

[01 36] Other examples of cleavable linkers and the cleavage conditions that can be used to release tags are as fol- 
lows. A silyl linking group can be cleaved by fluoride or under acidic conditions. A 3-. 4-. 5-. or 6-substituted-2-nitroberv 
zyloxy or 2-. 3-. 5-. or 6-substituted-4-nitrobenzyfoxy linking group can be cleaved by a photon source (photolysis) A 3- 
. 4-, 5-. or 6-substrtuted-2-alkoxyphenoxy or 2-. 3-. 5-. or 6-substrtuted-4-aikoxyphenoxy linking group can be cleaved 
by Ce(NH 4 ) 2 (N03) s (oxidation). A NCC^ (urethane) linker can be cleaved by hydroxide (base), acid, or LiAIH 4 (reduc- 
tion). A 3-pentenyl, 2-butenyl. or 1-butenyl linking group can be cleaved by 0 3 , OjAi/ICV. or KMn6 4 (oxidation) A 2- 
[3-. 4-. or 5-substituted-furyf]oxy linking group can be cleaved by 0 2 . Br 2 . MeOH, or acid. 

[01 37] Conditions for the cleavage of other labile linking groups include: t-alkyloxy linking groups can be cleaved by 
acid; methyl(diaikyl)methoxy or 4-substituted-2-alkyl-1,3-dioxlane-2-yl linking groups can be cleaved by hUO* 2 
silylethoxy linking groups can be cleaved by fluoride or acid; 2-(X)-ethoxy (where X = keto. ester amide, cyano. NO* 
sulfide, sulfoxide, sulfone) linking groups can be deaved under alkaline conditions; 2-. 3-, 4-, 5-. or 6-substituted-bert 
zyloxy linking groups can be deaved by acid or under reductive conditions; 2-butenyloxy linking groups can be deaved 
by (Ph 3 P) 3 RhCI(H), 3-. 4-, 5-, or 6-substituted-2-bromophenoxy linking groups can be deaved by Li, Mg or BuLr meth- 
ylthiomethoxy linking groups can be deaved by Hg 2 *; 2-(X)-ethyloxy (where X » a halogen) linking groups ^an be 
cleaved by Zn or Mg; 2-hydroxyethy!oxy linking groups can be cleaved by oxidation (e.g.. with Pb(OAc) 4 ) 
[01 38] Preferred linkers are those that are cleaved by acid or photolysis. Several of the add-labile linkers that have 
been developed for solid phase peptide synthesis are useful for linking tags to MOIs. Some of these linkers are 
described in a recent review by Uoyd- Williams et al. ( Tetrahedron 49:1 1065-1 1 133. 1993). One useful type of linker is 
based upon p-aJkoxybenzyl alcohols, of which two. 4-hydroxymethylphenoxyacetic acid and 4-(4-hydroxymethyl-3- 
methoxyphenoxy)butyric acid, are commercially available from Advanced ChemTech (Louisville. KY). Both linkers can 
be attached to a tag via an ester linkage to the benzylaJcohol, and to an amine-containing MOI via an amide linkage to 
the cart»xyiic acid. Tags linked by these molecules are released from the MOI with varying concentrations of triffuoro- 
acetic acid. The cleavage of these linkers results in the liberation of a carboxylic add on the tag. Acid deavage of tags 
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attached through related linkers, such as 2.4^imethcocy^^( ra rboxym e thyloxy)4 3 enzr V drylarr»ne (available from 
Advanced ChemTech m FMOC-protected farm), results in liberation of a carboxylic amide on the released tag, 
[01 39] The photolabile linkers useful for this application have also been for the most part developed for solid phase 
peptide synthesis (see Lloyd- Williams review). These linkers are usually based on 2-nitrobenzylesters or 2-nitrobenzv- 
lamides. Two examples of photolabile linkers that have recently been reported in the literature are 4-(4-(1-Fmoc- 
arror»)ethyl)-2-methoxy-5-nitrophenoxy)butanoic acid (Holmes and Jones. J. Org. Chem. 602318-2319 1995) and 3- 
(Fnw-ainino)^-(2-nitropheny0propionic acid (Brown et al. Molecular Diversity 7:4-12. 1995) Both linkers can be 
attached via the carboxylic acid to an amine on the MOI. The attachment of the tag to the linker is made by forming an 
amide between a caiboxylic acid on the tag and the amine on the linker. Cleavage of photolabile linkers is usualty per- 
formed with UV light of 350 nm wavelength at intensities and times known to those in the art. Cleavage of the linkers 
resufts in liberation of a primary amide on the tag. Examples of photocleavabie linkers include nitrophenyt glycine 
ester* exo- and endo-2-benzonorbomeyl chlorides and methane sulfonates, and 3-amino-3(2-nitropheny0 propionic 
acid. Examples of enzymatic cleavage include esterases which will cleave ester bonds, nucleases which will cleave 
phosphodiester bonds, proteases which cleave peptide bonds, etc. 

[0140J A preferred linker component has an ortho-nitroberizyl structure as shown below: 




wherein one carbon atom 
is present to the left of N(R 



at positions a b. c. d or e is substituted with -L 3 -X. and L 1 (which is preferably a direct bond) 
< > ,n «"« above structure. Such a linker component is susceptible to selective photo- induced 
cleavage of the bond between the carbon labeled "a" and N(R<). The identity of R 1 is not typically critical to the cleavage 
reaction, however R rs preferably selected from hydrogen and hydrocarbyl. The present invention provides that inthe 
above structure. -IM(R')- could be replaced with -O-. Also in the above structure, one or more of positions b c. d or e 
may optionally be substituted with alkyl. alkoxy. fluoride, chloride, hydroxy!, carboxylate or amide, where these substit- 
uents are independently selected at each occurrence. 

[01 41 J A further preferred linker component with a chemical handle has the following structure- 




wherein one or more of portions b. c. d or e is substituted with hydrogen, alkyl. alkoxy. fluoride, chloride hydroxvl car- 
boxylate or amide, R' is hydrogen or hydrocarbyl. and R* is -OH or a group that either protects or activates a carboxylic 
acd for coupling with another moiety. Fluorocarbon and hydrof luorocarbon groups we preferred groups that activate a 
carboxylic acid toward coupling with another moiety. 



3. Molecule of Interest (MOI) 



0142] Examples of MOIs include nucleic acids or nucleic acid analogues (e.g.. PNA), fragments of nucleic acids 
(i.e.. nucleic acid fragments), synthetic nucleic acids or fragments, oligonucleotides (e.g.. DNA or RNA) proteins pep- 
tides, antibodies or antibody fragments, receptors, receptor ligands, members of a ligand pair, cytokines 'hormones oB- 
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gosaccharides, synthetic organic molecules, drugs, and combinations thereof. 

[01 43J Preferred MOIs include nucleic acid fragments. Preferred nucleic acid fragments are primer sequences that 
are complementary to sequences present in vectors, where the vectors are used for base sequencing. Preferably a 
nucleic acid fragment is attached directly or indirectly to a tag at other than the 3' end of the fragment; and most prefer- 
ably at the 5' end of the fragment. Nucleic acid fragments may be purchased or prepared based upon genetic data- 
bases (e.g.. Dib et aL. Nature 350:1 52-1 54. 1 996 and CEPH Genotype Database. http://Wvvw.cephb.fr) and commercial 
vendors (e.g.. Promega, Macfison, Wl). 

[0144J As used herein, MOI includes derivatives of an MOI that contain functionality useful in joining the MOI to a 
T-L-Lh compound. For example, a nucleic acid fragment that has a phosphodiester at the 5* end. where the phosphodi- 
ester is also bonded to an alkyleneamine. is an MOI. Such an MOI is described in. ag.. U.S. Patent 4.762.779 which is 
incorporated herein by reference. A nucleic acid fragment with an internal modification is also an MOI. An exemplary 
internal modification of a nucleic acid fragment is where the base (e.g.. adenine, guanine, cytosine. thymidine, uracil) 
has been modified to add a reactive functional group. Such internally modified nucleic acid fragments are commercially 
available from. e.g.. Glen Research, Herndon. VA. Another exemplary internal modification of a nucleic acid fragment 
is where an abasic phosphoramidate is used to synthesize a modified phosphodiester which is interposed between a 
sugar and phosphate group of a nucleic acid fragment The abasic phosphoramidate contains a reactive group which 
allows a nucleic acid fragment that contains this phosphoramidate-derived moiety to be joined to another moiety, e.g. a 
T-L-L^ compound Such abasic phosphoramidates are commercially available from. e g.. Clonetech Laboratories, Inc 
Palo Alto. CA. 

4. Chgmipql Hqnfles (L£ 

10145] A chemical handle is a stable yet reactive atomic arrangement present as part of a f irst molecule, where the 
handle can undergo chemical reaction with a complementary chemical handle present as part of a second molecule 
so as to form a covalent bond between the two molecules. For example, the chemical handle may be a hydroxy! group! 
and the complementary chemical handle may be a carboxylic acid group (or an activated derivative thereof, e.g a 
hydrofluroaryl ester), whereupon reaction between these two handles forms a covalent bond (specifically, an ester 
group) that joins the two molecules together. 

[0146] Chemical handles may be used in a large number of covalent bond-forming reactions that are suitable for 
attaching tags to linkers, and linkers to MOIs. Such reactions include alkylation (e.g.. to form ethers, thioethers). acyta- 
tion (e.g.. to form esters, amides, carbamates, ureas, thioureas), phosphorylation (e.g.. to form phosphates, phospho- 
nates. phosphoramides. phosphonamides). suHonylation (e.g.. to form sulfonates, sulfonamides), condensation (e.g., 
to form imines. oximes. hydrazones). silylation, disulfide formation, and generation of reactive intermediates, such as 
nitrenes or carbenes, by photolysis. In general, handles and bond-forming reactions which are suitable for attaching 
tags to linkers are also suitable for attaching linkers to MOIs, and vice-versa. In some cases, the MOI may undergo prior 
modification or derivitization to provide the handle needed for attaching the linker. 

[0147] One type of bond especially useful for attaching linkers to MOIs is the disulfide bond. Its formation requires 
the presence of a thiol group ("handle") on the linker, and another thiol group on the MOI. Mild oxidizing conditions then 
suffice to bond the two thiols together as a disulfide Disulfide formation can also be induced by using an excess of an 
appropriate disulfide exchange reagent, e.g.. pyridyl disulfides. Because disulfide formation is readily reversible the 
disulfide may also be used as the cleavable bond for liberating the tag, if desired. This is typically accomplished under 
similarly mild conditions, using an excess of an appropriate thiol exchange reagent, e.g.. dhhiothreitd. 
[0148] Of particular interest for linking tags (or tags with linkers) to oligonucleotides is the formation of amide bonds. 
Primary aliphatic amine handles can be readily introduced onto synthetic oligonucleotides with phosphoramidHes such 
as 6-monomethoxytritylhexylcyanoethyl-N,N-diisopropyl phosphorarnkJite (available from Glenn Research. Sterling, 
VA). The amines found on natural nucleotides such as adenosine and guanosine are virtually unreactive when com- 
pared to the introduced primary amine. This difference in reactivity forms the basis of the ability to selectively form 
amides and related bonding groups (e.g.. ureas, thioureas, sulfonamides) with the introduced primary amine, and not 
the nucleotide amines. 

[0149] As listed in the Molecular Probes catalog (Eugene, OR), a partial enumeration of amine-reactive functional 
groups includes activated carboxylic esters, isocyanates, isothiocyanates. sulfonyl halides. and dichlorotriazenes 
Active esters arc excellent reagents for amine modification since the amide products formed are very stable. Also these 
reagents have good reactivity with aliphatic amines and low reactivity with the nucleotide amines of oligonucleotides 
Examples of active esters include N-hydroxysuccinimide esters, pentafluorophenyl esters, tetrafluorophenyl esters and 
p-nitrophenyl esters. Active esters are useful because they can be made from virtually any molecule that contains a car- 
boxylic acid. Methods to make active esters are listed in Bodansky (Principles of Peptide Chemistry (2d ed ) Sprinqer 
Verlag, London, 1993). ' 
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5. Linker Attachment 

VS. . Typica " y> a single ,ype of finker is used to connect a particular set or family of tags to a particular set or famflv 

ST^TiS; l 6 S red er £° diment ° f I? 9 invention - a unifo ' m P roc *^ -ay be followed create all the var 
ous T-L-MOI structures. Tire .s espeaally advantageous when the set of T-L-MOI structures is large, because it altows 
the set to be prepared usmg the methods of combinatorial chemistry or other parallel processing technology Ina sirrtS 

tTTH T 01 a ^ n9 ' e ^ e0f i nker aUOWS 3 Sing,e - uniform ^ to ^ employed for c.eaving^ the ^ 
T-L-MOI structures. Again, this is advantageous for a large set of T-L-MOI structures, because the set mavbeomJ 
essed in a parallel, repetitive, and/or automated manner. because me set may be proc- 

' !° 1511 _ Jhere however, other embodiment of the present invention, wherein two or more types of linker are used 
to connect different subsets of tags to corresponding subsets of MOIs. in this case, selective cleavage conditions mav 
™«f 10 ?T aVe ' inkerS '"dependent!* without deaving the linkers present on other subsets* Sk 

231? o _ " number 01 covalent »»nd-forming reactions are suitable for attaching tags to tinkers and linkers to 
MO* Such reactions include Ration (e.g.. to form ethers, thioethers). acylatton (a?., tftorm ^^ide^L 

^T? 8, I 68 !" ^"; reaS) ' P ^° Sph0rylat,0n < ft »- to form phosphates, phosphorates, phosphoramides. phosrSW 
mries^ sutfonylaoon (e.g to torm sulfonates, sulfonamides), condensation (e.g. to form im^admes. hydSon^ 
s..yl a »on disulfide formation, and generation of reactive intermediates, such as nitrenes or carbenesX XS&tl 

fZZZ^^rS?"'*™*" 9 T*"* WhiCh 8re SUitaWe attachin9 *■>» to **« «• also suSSXctv 
ng linkera to MOIs. and vice- versa In some cases, the MOI may undergo prior modification or derivrtization to provide 
the handle needed for attaching the linker. provide 

[01 531 One type of bond especially useful for attaching linkers to MOIs is the disulfide bond. Its formation requires 
the presence of a thwl group ("handle", on the linker, and another thiol group on the MOI. Mild «Hbfc5SS£^ 

IT 9 f°? *Z!~ T' S t09ethef 38 9 d ' SU,,ide - DiSU,fide ,0rmation <*" a,SO ^ induced by udn^aTexc^s -SZ 
appropriate disulfide exchange reagent, e.g.. pyridyl disulfides. Because disulfide formation is readly revenue the 
disulfide may also be used as the CeavaWe bond for liberating the tag. if desired. This is typically accomplished undS 
similarly rn.Wcondrt.ons. using an excess of an appropriate thiol exchange reagent e.g.. ditoiothrertoT 

K J? partK * Jar for ,inKin9 ,a9S t0 oligonucleotides is the formation of amide bonds Primary aliphatic 

3 !TL h f S !'i^ y J ' ntrDdUCed 00,0 synthetic ^nucleotides wrth phosphoramdites such as 6-monome^ 
oxytrrrylhex7lcyanoeth y l -N, N-dnsopropyl phosphoramidite (available from Glenn Research. Sterting VA) TheTm.nl 
found on natural "udeotides such as adenosine and guanosine are virtually unreactive when ZpTed to lTZt 
duced pnmary arrnne. This drfference in reactivity forms the basis of the ability to selectively form amides and refated 
bond^g groups (e.g. ureas thioureas sulfonamides) with the introduced primary aminl andZTe^eSoe 

[ °l 5 *{ Bsted j n **. Mde ~ , » Probes catal °9 (^Sene. OR), a partial enumeration of amine-reactive functional 
groups ^eludes activated carboxylic esters, isocyanates isothiocyanates. sulfonyl halides. and dicWorotoaTe^es 
Active esters are excellent reagents for amine modification since the amide products formed ar; very sSfo^Kese 
reagerrts have good reacfivrty with aliphatic amines and low reactivity wrth the nucleotide amines ^TgonuditiaS 
Examples of active esters include N-hydroxysucdnimide esters, pentaf luoropheny. esters, tetraf luoiSSTJSSjS 
p-mtrophenyl esters. Acuve esters are useful because they can be made from virtually any molecule toa t conSSa" S 

S Lon^n^ 0 3CtiVe 6SterS MSted i0 BOdanSKy ° f ChGmiStry (26 *•>• 

10156] Numerous commercial cross-linking reagents exist which can serve as linkers (e.g.. see Pierce Cross-link- 
ers. P.erce Chemical Co. Rockford. IL). Among these are homobrtunctional amr^eactive cnSISS» reaTel 
TSl'Z exe 7 ,tfi ^ ( b y ."^^to^ona' imidoesters and N-hydroxysuccinimidyl (NHS) esters. There also exist het- 
erobrtunctiona. cross-hntang reagents possess two or more different reactive groups that allows for sequential I reac- 
t.ons. .masters read rap,dly with amines at alkaline pH. NHS-esters give stable products when reacted wrth primer 

* Z fr^^ ^ ary * ha,ideS ' ^-"atoaoyls and pyridyl disuses are thiol readme S 

m ld es are specrf.c for thiol (sutfhydryl) groups in the pH range of 6.5 to 7.5. and at alkaline pH can become aminVreac- 
ftve. The toioether linkage is stable under physiological conditions. Alpha-haloacetyl cross-linking reagents contain toe 
.odoacetyl group and are reactive towards suHhydryls. Imidazoles can read with the iodoacetyl moiety but the reaction^ 
is very slow. Pyridyl disulfides react with thiol groups to form a disulfide bond. Carbodiimides co^pte a2!Ste55 
mary amines of hydrazides which give rises to the formation of an acyl-hydrazine bond. The arylazides are photoaffinrty 
reagents which are chemically inert until exposed to UV or visible light. When such compounds are photdyzedTt 250^ 
460 nm. a reactive aryl nitrene is formed. The reactive aryl nitrene is relatively non-specific Glyoxals are reactive 
towards guanidinyl portion of arginine. ^ oiyoxais are reactive 

I 015 ^ ,- ? emb ° d i ment of the ^ent invention, a tag is first bonded to a linker, then the combination of 

tag and linker is bonded to a MOI. to create the structure T-L-MOI. Alternatively, the same structure is SnSSbJftS 
bonding a linker to a MOI. and then bonding the combination of linker and MOI to a tag. An example is where the MOI 
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is a DNA primer or oligonucleotide. In that case, the tag is typically first bonded to a linker, then the T-L is bonded to a 
ONA primer or oligonucleotide, which is then used, for example, in a sequencing reaction. 

[0158] One useful form in which a tag could be reversiWy attached to an MOI {e.g., an oligonucleotide or DNA 
sequencing primer) is through a chemically labile linker. One preferred design for the linker allows the linker to be 
cleaved when exposed to a volatile organic add. for example, trifluoroacetic acid (TFA). TFA in particular is compatftte 
with most methods of MS ionization, including electrospray. 

[01 591 As described in detail below, the invention provides a method for determining the sequence of a nucleic acid 
molecule. A composition which may be formed by the inventive method comprises a plurality of compounds of the for- 
mula: 

T^-L-MOI 



wherein T™ is an organic group detectable by mass spectrometry. T™ contains carbon, at least one of hydrogen 
and fluoride, and may contain optional atoms including oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula. 
L is an organic group which allows a T^-containing moiety to be cleaved from the remainder of the compound upon 
exposure of the compound to cleavage condition. The cleaved 7™N»ntaining moiety includes a functional group which 
supports a single ionized charge state when each of the plurality of compounds is subjected to mass spectrometry. The 
functional group may be a tertiary amine, quaternary amine or an organic acid. In the formula. MO I is a nucleic acid 
fragment which is conjugated to L via the 5* end of the MOI. The term "conjugated" means that there may be chemical 
groups intermediate L and the MOI. e.g., a phosphodiester group and/or an alkylene group. The nucleic acid fragment 
may have a sequence complementary to a portion of a vector, wherein the fragment is capable of priming nucleotide 
synthesis. 

[01601 In the composition, no two compounds have either the same T™ 3 or the same MOI. In other words, the com- 
position includes a plurality of compounds, wherein each compound has both a unique T™* and a unique nucleic acid 
fragment (unique in that it has a unique base sequence). In addition, the composition may be described as having a 
plurality of compounds wherein each compound is defined as having a unique T™. where the T™ is unique in that no 
other compound has a T™ 8 that provides the same signal by mass spectrometry. The composition therefore contains a 
plurality of compounds, each having a T™ 8 with a unique mass. The composition may also be described as having a 
plurality of compounds wherein each compound is defined as having a unique nucleic acid sequence. These nucleic 
acid sequences are intentionally unique so that each compound will serve as a primer for only one vector, when the 
composition is combined with vectors for nucleic acid sequencing. The set of compounds having unique Tms groups is 
the same set of compounds which has unique nucleic acid sequences. 

[01 61 1 Preferably, the T™ groups are unique in that there is at least a 2 amu. more preferably at least a 3 amu. and 
still more preferably at least a 4 amu mass separation between the T™ groups of any two different compounds. In the 
composition, there are at least 2 different compounds, preferably there are more than 2 different compounds, and more 
preferably there are more than 4 different compounds. The composition may contain 100 or more different compounds 
each compound having a unique T™ and a unique nucleic acid sequence. 

[01621 Another composition that is useful in, e.g., determining the sequence of a nucleic acid molecule, includes 
water and a compound of the formula T^-L-MOI. wherein T™ is an organic group detectable by mass spectrometry. 
T™ contains carbon, at least one of hydrogen and fluoride, and may contain optional atoms including oxygen, nitrogen! 
sulfur, phosphorus and iodine. In the formula, L is an organic group which allows a T m8 -containing moiety to be cleaved 
from the remainder of the compound upon exposure of the compound to cleavage condition. The cleaved T^-contain- 
ing moiety includes a functional group which supports a single ionized charge state when each of the plurality of com- 
pounds is subjected to mass spectrometry. The functional group may be a tertiary amine, quaternary amine or an 
organic acid. In the formula. MOI is a nucleic acid fragment attached at its 5* end. 

[0163] In addition to water, this composition may contain a buffer, in order to maintain the pH of the aqueous com- 
position within the range of about 5 to about 9. Furthermore, the composition may contain an enzyme, salts {such as 
MgCI 2 . and NaCI) and one of dATR dGTR dCTP. and dTTR A preferred composition contains water, T^-L-MOI and 
one (and only one) of ddATP, ddGTR ddCTR and ddTTR Such a composition is suitable for use in the dideoxy sequenc- 
ing method. 

[0164] The invention also provides a composition which contains a plurality of sets of compounds, wherein each set 
of compounds has the formula : 



T™-L-MO! 



wherein, 

[0165] T™ is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. L is an organic group which 



33 



EP 0 992 511 A1 



ment wherein L is conjugated to MOI at the MOI's 5' end. * aCld 9 " 



70 



2!!S JIT'" 3 S !L a " ""^^ hawB me Sa ™ group, and the MOI fragments have variable lengths that ter 
^ J* Sa T ied,deoxynucteolid8 ^ected from ddAMP. ddGMP. ddCMP andddTMP- L be^«S th!,^ 
= bya,.eas,2an W . pre »e^ bya , least 3 an . T , e ^^ 

[01671 m a preferred composition comprising a first plurality of sets as described above there is addition^ 
present a second plurality of sets of compounds having the formula 18 addltonaUy 



T^-L-MOI 



[0168] The invention also provides a kit for DNA seaueneinn onaiweie tk« i^* _ 

sets, where each container set includes at leasTf SmaiSZ ^cmp^ 3 pluraB * <* contei "« 

* fourth and fifth containers contain compouiSol TiEHS? 3 ^ SeCOnd * ,,W ' 

r™-L-MOI 



allows a T^-containing moiety to be deav^from ISS^S SlTc^Z^ w l LT^SE? ^ 

spectrometry and is selected from tertiary amine, quaternary amine and oraanic acid ^Z^TJfJ • ^ 
wherein L is conjugated to MOI at the MOI's 5' end The MOI for^e s^Tnd Sd S£2 ^ ^ C ** fragment 
35 and complementary to a portion of the vector JSr Mne s« tfc^^and I^ZZZ*^? " 

different from the other T 1 ™ groups in the kit containers, and the T™ group within each container is 

£5 JZZ3Z^£££ZT' ,S 3t ' eaSt 3 ' th6re " 31 ' eaSt ^ - S « ™ Mce pref- 

ments which are corrplementary to a JSSSSZ^IT.^ ££2 So^St^ ^ 
second terminus of a nucleic acid mo.ecule). wherein a tag is corrective ^TJS!^ S^TuJZd? 

« ecule. Each of the aspects will be discussed in n4e S^SST* determ,n,n9 * e Sequence °< •» ™*c add nd- 
B. SEQUENCING MFTHODS AND RTp Ajcra.p^ 

so ecu*\ief^ T^lT^ - qU ™ * 3 * C * ^ 

targe, nudeic aci2 molecu.e. In a p^£2£S£ ^t^^^^ 6 ^^^^ 
to a second terminus of a nucleic acid molecule, and more preferably from a * t^n IT,^! ^ rm,nUS 
ferred embodiments, the tagged fragments are oenerated frnm^ Ln^Ti to a 3 teriT1,nus - ,n ottler P'e- 

deotide terminators. A tag o?a tagg'ed ^ac^agmentT^ 

55 by spectrometry (induding fluorescence, but preferably other than ^«!J™5 „ k n " d ^ atuSe 31x1 ,s detectable 
embodiment, at least five tagged nudeic areola ^ a^arl t^ ? ,n a ferred 

ment. More speedy. the number of tagged iJS^SKX £JH s"K£ Th^^ 
ac* fragments may be generated from a variety of compounds, including those set forth abov^Twi^eS^e 
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in the art tat the methods of the present invention are not limited to use only of the representative compounds and com- 
positions described herein 

[0172] Following generation of tagged nucleic acid fragments, the tagged fragments are separated by sequential 
length. Such separation may be performed by a variety of techniques. In a preferred embodiment, separation is by liquid 
chromatography (LC) and particularly preferred is HPLC. Next, the tag is cleaved from the tagged fragment The partic- 
ular method for breaking a bond to release the tag is selected based upon the particular type of susceptibility of the 
bond to cleavage. For example, a light-sensitive bond (i.e. one that breaks by light) will be exposed to light The 
released tag is detected by spectrometry or potentiometry. Preferred detection means are mass spectrometry infrared 
spectrometry, ultraviolet spectrometry and potentiostatic amperometry (e.g.. with an amperometric detector or coule- 
metric detector). * 

[01731 « will be appreciated by one in the art that one or more of the steps may be automated eg by use of an 
instrument. In addition, the separation, cleavage and detection steps may be performed in a continuous manner (e a 
continuous flow/continuous fluid path of tagged fragments through separation to cleavage to tag detection) For exam" 
pie. the various steps may be incorporated into a system, such that the steps are performed in a continuous manner 
Such a system is typically in an instrument or combination of instruments format For example, tagged nucleic acid frag- 
ments that are separated (e.g.. by HPtC) may flow into a device for cleavage (e g., a photo-reactor) and then into a tag 
detector (e.g.. a mass spectrometer or coulometric or amperometric detector). Preferably, the device for cleavage is tun- 
able so that an optimum wavelength for the cleavage reaction can be selected. 

[0174] It will be apparent to one in the art that the methods of the present invention for nucleic acid sequencing may 
be performed for a variety of purposes. For example, such use of the present methods include primary sequence deter- 
mmaton for viral, bacterial, prokaryotic and eukaryotic (e.g.. mammalian) nucleic acid molecules; mutation detection- 
diagnostics: forensics; identity: and polymorphism detection. 

t. Sequencin g Methods 

[0175] As noted above, compounds including, those of the present invention may be utilized for a variety of 
sequencing methods, including both enzymatic and chemical degradation methods. Briefly, the enzymatic method 
described by Sanger (Proc. Natl. Acad. Sd. (USA) 74:5463. 1977) which utilizes dideoxy-terminators. involves the syn- 
thesis of a DNA strand from a single-stranded template by a DNA polymerase. The Sanger method of sequencing 
depends on the fact that that dideoxynucleotides (ddNTPs) are incorporated into the growing strand in the same wav a 
normal deoxy nucleotides (albert at a lower efficiency). However. ddNTPs differ from normal deoxy nucleotides (dNTPs) 
in that they lack the 3-OH group necessary tor chain elongation. When a ddNTP is incorporated into the DNA chain 
the absence the 3-hydroxy group prevents the formation of a new phosphodiester bond and the DNA fragment is ter- 
minated with the ddNTP complementary to the base in the template DNA. The Maxam and Gifoert method (Maxam and 
Gilbert, Proc. Natl. Acad. Sci. (USA) 74:560. 1977) employs a chemical degradation method of the original DNA (in 
both cases the DNA must be clonal). Both methods produce populations of fragments that begin from a particular point 
and terminate in every base that is found in the DNA fragment that is to be sequenced. The termination of each frag- 
ment is dependent on the location of a particular base within the original DNA fragment The DNA fragments are seo- 
arated by polyacrylamide gel electrophoresis and the order of the DNA bases (AC.T.G) is read from a autoradiooraoh 
of the gel. 

2. Exonuclease DNA Sequencing 

[0176] A procedure for determining DNA nucleotide sequences was reported by Labert et al (S Labeit H Lebrach 
& R. S. Goody. DNA 5: 173-7. 1986; A new method of DNA sequencing using deoxynucleoside alpha-thiotriphos- 
phates). In the first step of the method, four DNAs, each separately substituted with a different deoxynucleoside phos- 
phorothioate in place of the corresponding monophosphate, are prepared by template-directed polymerization 
catalyzed by DNA polymerase. In the second step, these DNAs are subjected to stringent exonuclease III treatment 
which produces only fragments terminating with a phosphorothioate internucleotide linkage. These can then be sepa- 
rated by standard get electrophoresis techniques and the sequence can be read directly as in presently used sequenc- 
ing methods. Porter et al. (K. W. Porter. J. Tomasz. F. Huang. A. Sood & B. R. Shaw. Biochemistry 34- 1 1963-1 1969 
1995; N7-cyanoborane-2'-deoxyguanosine SMriphosphate is a good substrate for DNA polymerase) described a nevv 
set of boron-substituted nucleotide analogs which are also exonuclease resistant and good substrates for a number of 
polymerases: these base are also suitable for exonuclease DNA sequencing. 

3. A Simplified Strategy for Sequen cing Large Numbers of Full Length cDNAs. 

[0177] cDNA sequencing has been suggested as an alternative to generating the complete human genomic 
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sequence. Two approaches have been attenpted. The first involves generation of expressed sequence tags (ESTs) 
througha single DNA sequence pass at one end of each cDNA done. This method has given inserts into the d Stw 
hon of types of expressed sequences and has revealed occasional useful homology with genomic Cents btf otSl 
has added little to our knowledge base since insufficient data from each clone ^^^s^^^Tto 
generate conplete cDNA sequence which can indicate the possible function of the cDNAs. UrifortonatefyVnostcDN/S 
mL ^^T^ Z hinders the automation of fulMength sequence deteS^^emHe 

™* £ ar9 l SCa ' e - hiQh throu 9 h P ut se < uenc9 P'°*«*°" ^ "rom sequencing from a vectoS? 

s.te. which typically y.elds less than 500 bases of sequence from each flank. The synthesis of new diqomx^tideDnZ 
ers of length ,5-18 bases tor -primer walking- can allow closure of each sequenced anernativT^S^to^Zn 
cDNA sequencing .s to generate modified templates that are suitable for sequencing with a universal prima btforo^foe 
overlapping coverage of the molecules. primer, out provae 



J01 78] Shotgun sequencing methods can be applied to cDNA sequencing studies by preparino a seoarate librarv 
fron eachcDNA clone. These methods have not been used extensively for the analvsisVtn^ J olSsS 

applied to projects where the target sequence is of the order of 1 5 to 40 kilobases. such as in lambda or Lrnid insert 
4- Analoov of cDNA with Genomic Seg ntynrin^ 

. I , 01 j[ 91 . °® sprte 1,19 typicaMy different size of the individual clones to be analyzed in cONA sequencing there are sim- 
ilarities with the requirements for large scale genomic DNA sequencing. In addition to a low S 1^ base an^a hi£ 
throughput the idea, strategy for full length cDNA sequencing wil, have a high accural ^Z^^'JZ JT 
ology for genomic DNA sequencing involves the preparation of shotgun sequencing libraries from cZZLm^rtt 
random sequencing using AB, fluorescent DNA sequencing instrument and closure (1^1^)^^^^ 
Overall there is agreement that toe ^orescent shotgun approach is superior to curreni 2£2£ SE£ 71 
aency and accuracy. The ratal shotgun library quality is a critical determinant of the ease and oualttv ^!ZmL 
assembly. The high quality of toe available shotgun library procedure has prompted a strategy for mif 
tiplex shotgun braries containing mixtures of the smaller cDNA clones. Here^ividualltolSs to bSSHSH 

Zn^lSl ^^TfT ^ T aentified fo,,0win9 random sequencing, at toe stage of conW^naW 

Junctons between individual clones are labeled during library production either by PCR or by identification of 

arm sequence. ^ au ^"' Ul vector 

[0180] Clones may be prepared both by microbial methods or by PCR. When using PCR. three reactionsfrom each 
clone are used in order to minimize toe risk for errors. eacnuns irom eacn 

10181] One pass sequencing is a new technique designed to speed toe identification of important sequences within 
a ™ ^ ,0 " * gnomic DNA. Briefly, a high quality shotgun library is prepared and then SSSS ££££ 
obtain 80 - 95% coverage. For a cosmid this would typically be about 200 samples. Essentially aB geneVarelSSv to 
scTeen?r5 " or Jon st^re (gS^ 

101821 "Skimming" has been successfully applied to cosmids and P1s. One pass sequencing is ootentiallv the fast 
est and least expensive way to find genes in a positional cloning project. The outcome * ^ SEEK toS 
t.gators are currently developing cosmid contigs tor axon trapping and related techniques. Cosmtos aTe compel 
surtable tor sequence skimming. P1 and other BACs could be considerably cheaper since there Jsavin« bo^^ 
gun library construction and minimization of overlaps. savings both in shot- 

5. Shotaun Sequencing 

[0183] Shotgun DNA sequencing starts with random fragmentation of the target DNA. Random seouencino « th~, 

s^>ir rate ,r: maiority of da r A directed phase comp,etes » ^^3^" 

boto direcbons. Shotgun sequencing offers the advantage of high accuracy at relatively low cost The procedure sbeS 

ESS ^ ^ ^ y ^ fra9mentS 3nd iS me methQd 01 choice in ,ar 9 9 50319 aenomic SHEES? 
[0184] There are several factors that are important in making shotgun sequencing accurate and cost SeSve A 
major consideration is the quality of the shotgun library that is generated, since any Cones that do not h^e Si 
have chimeric mserts. will result in subsequent inefffcient sequencing. Another consideration is the c^eW bTncS of 
the random and the d,rected phases of toe sequencing, so that high accuracy is obtained with nSfoTanE EE? 
ciency through unnecessary sequencing. oss OT eTTH 

6. Sequencing Chemistry: Tagged -Terminator Chpmistry 

[0185] There are two types of fluorescent sequencing chemistries currently available: dye primer, where the primer 
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is flu<^^y labeled, and dye terminator, where the dideoxy terminators are labeled. Each of these chemistries can 
be used w.th either Taq DMA polymerase or sequenase enzymes. Sequenase enzyme seems to read easily throuah G- 
C rich regions, palindromes, simple repeats and other difficult to read sequences. Sequenase is also good for seouenc- 
<ng mixed populations. Sequenase sequencing requires 5 ug of template, one extension and a multi-step cleanup proc- 
ess. Tagged-pnmer sequencing requires four separate reactions, one for each of A. C. G and T and then a laborious 
cleanup protocol. Taq terminator cycle sequencing chemistry is the most robust sequencing method With this method 
any sequencing primer can be used. The amount of template needed is relatively small and the whole reaction process 
from setup to cleanup is reasonably easy, compared to sequenase and dye primer chemistries. Only 1 5 ug of DNA 
template and 4 pm of primer are needed. To this a ready reaction mix is added. This mix consists of buffer enzyme 
J ,ab !, ,ed dideOXymic,eotidea This reactio " «" be done in one tube as each of the four dideoxies 'is labeled 
wrt, a drfferent fluorescent dye These labeled terminators are present in this mix in excess because they are difficult 

Zl n ^°Z 9 eXt , enSK3n - With unclean DNA incorporation of these high molecular weight dideoxies can be 
-nhrfxted. The prem.x .ncludes dITP to minimize band compression. The use of Taq as the DNA polymerase allows the 
reactions to be run at high temperatures to minimize secondary structure problems as well as non-specific primer bind- 
ing. The whole cocktail goes through 25 cycles of denaturation. annealing and extension in a meTmaJ cycler and ™e 
completed reaction .s spun through a Sephadex Q50 (Pharmacia. Piscataway. NJ) column and is ready for gel loading 
alter five minutes in a vacuum dessicator. s loaning 

7. Designing Primers 

[0186] When designing primers, the same criteria should be used as for designing PCR primere. In particular prim- 
ers shouW preferatty be 1 8 to 20 nucleotides long and the 3-prime and base should be a Q or a C. PfimnSSuZo 
preferably have a Tm of more than 50°C. Primers shorter than 18 nucleotides will work but are not recornmended Tto 
shorter the pnmer the greater the probability of it binding at more than one site on the template DNA. and the tower te 
Tm. The sequence should have 100% match with the template. Any mismatch, especially towards the 3-prime end wS 
greatly oirninish sequencing ability. However primers with 5-prime tails can be used as long as there is about 1 8 bas« 
at 3-pnme that bind. If one is designing a primer from a sequence chromatogram. an area with high confidence must 
be used. As one moves out past 350 to 400 bases on a standard chromatogram. the peaks get broader and toe base 
calls are not asaccurate. As described herein, the primer may possess a 5' handle through which a linker or linker tag 

8. Nucleic Acid Template Preoaratinn 

[0187] The most important factor in tagged-primer DNA sequencing is the quality of the template Briefly one com 
mon misconception is that if a template works in manual sequencing. Jshoufo^nVin autcSsecSnTto^ 
if a reaction works in manual sequencing it may work in automated sequencing, however, automated sequtiSngt 
much more sensitive and a poor quality template may result in GttJe or no data when fluorescent sequencing mSSte 
are utilized. High salt concentrations and other cell material not properly extracted during template preparation tocfod- 
^ *T?/ " keW,S6 prevent tha ****** 10 obtain a «™*e sequence information. Many mini and maxi prep protoaJs 
produce DNA which is good enough for manual sequencing or PCR. but not for automated (tagged-primer) seauendm 
Also the use of phenol is not at all recommended as phenol can intercalate in the helix structure^™* STtS 
roform .s suff.cent There are a number of DNA preparation methods which are particularly preferred tor the taoaed 
pnmer sequerwng methods provided herein. In particular, maxi preps which utilize cesium chloride preparations or Qia- 
gen (Chatsworto. CA) max, prep columns (being careful not to overload) are preferred. For mini preps, columns such 
as Promegas Magic Mm, prep (Madison. Wl). may be utilized. When sequencing DNA fragments such as ™R frat 
ments or restnct.on cut fragments, it is generally preferred to cut the desired fragment from a low melt argarose gel and 
then purrfy wrth a product such as GeneClean (La Jolla. CA). It is very important to make sure that only one bandTs cut 
from the get For PCR fragments the PCR primers or interna, primers can be used in order to ensur^ 
ate fragment was sequenced. To get optimum performance from the sequence analysis software, fragments should The 
larger than 200 bases. Double stranded or single stranded DNA can be sequenced by this method 
[0188] An add,tional factor generally taken into account when preparing DNA for sequencing is the choice of host 
strain. Compan.es sell.ng equipment and reagents for sequencing, such as ABI (Foster City. CA) and Qiaoen fChate 
worth. CA). typically recommend prefened host strains, and have previously recommended strains such as DH5 a lnh! 
HB101. XL-1 Blue. JM109. MV1 19 0. Even when the DNA preparations are very clean. tfSe «e o^herert ffaSoS 
which can make it diff icutt to obtain sequence. G-C rich templates are a«ways diff icult to sequel 3 
ary structure can also cause problems. Sequencing through a long repeats often proves to be difficult For instanceTs 
Taq moves along a poly T stretch, the enzyme often falls off the template and jumps back on again, skipping a T This 
results ,n extens,on products with X amount of Ts in the poly T stretch and fragments with X-1. X-2 etc amounts of Ts 
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IT^. ^ - *- » ■- — *» - *P— in each posifcn ^ „. seqiwnc . ^ 
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[0189J Sequencing may also be accomplished utilizing universal clonina vector /M131 an n 
sea^enang primers. Brief* tor present cloning vectors the sJ^^'S So^^TS 
errptoyed (each tag is a different fluorophore which represents a Meant tt3~l! ,!L«£L only 4 tags are 
process must take place in different corners ^^ ^^1 ^ every amplification 
« different DNA samples in the same amplification P^2^2T2m^ « ZT^° " 
tal 9 n T ^ n ° COnVen ' ent means to ^econvolute ?£2£X oTrTe ^^0^^^ oT^ 

(01901 A substantial advantage is gained when multiples of 4 tags can be run per ael lane nr rf «*wH„ a 
' 5 P«~- ln Pa^ar. *M*Q tags of the present invention, more than oneZw^Z Ta Z^Z^ ^ 

this advantage a senes of vectors must be generated in which a unique priming site is ireertedA 'unST^^S 
o .s s.mply a stretch of 18 nucleotides which differs from vector to vector The remSn^u^^^J^ 9 
served from vector to vector. A sequendng primer is nreoamd r~n*««£ JTUT 9 nudeot,de sequence is con- 
Each unique primer is derived (^^^^10X^2^ corresponds to each unique vector. 

Err^ * present invention to process rmi- 

» tors. For example, if Too unique vecloTs arTavaST ISfiZZZ ^SETS^?"" *° * VeC " 

performed. Second, one sample from each veX^e ,s rS^SlT^ ^ "* piddns 01 P"** 88 are 
unique DNA fragments or samples. A aSoCJSSSSS 2 ^ VeCt0,S COnteinin 9 100 
with the associated tag set Z -spec^r^ ^ " 

to the reaction container and the amplification process JmSZ ™! S^S^Z. I oofactorc are **ed 
and the respective sequence is es^Sed SnTtS eS an^n^T Z^l*?"* * * se P arato " 
samples has substantia, advantages. TheTeag^^^^^^^f J^^'J; to DNA 
method described herein the cost of amplif jcatioVonToer mE5m 2 ™ ^^ $ ,?° ^ Wrth tne 
Sample handling could be reduced by a factoTot at°^ 1 on ^ ^ * ,eaSt ^ a factor of 100. 

scale amplification robots would be ooviateT ' atena ' S ^ C0U,d ^ reduced - The need large 

10. Sequencing Vectors far Clesvabte Mass .Cp ^ ctros^ny Tannin,, 

primer is used tor each doning vector- each reart^hL ^> h « . Proceeds. In CMST sequencing, a different 
Primer correspor^ to one of "-e^™ 

performed on each pooled DNA sample (one tor each base), so every vector has tour otaESt Smlnfl^ "* 
.dentical ,n sequence but tagged with a different CMS tag. The four separate seaue^n?JT e ^^ pr,mer f'f acn one 
together. When 20 samples are pooled 80 tans ar« , ^ Z sequencing reactions are pooled and run 

detected simultaneous^ the geTfe rfn. <4 159863 Per Samp ' e t,meS 20 and all 80 are 

S T> Tl C JT trU 2 i0n 01 1,16 VeCt0rS may be accom P«shed by cloning a random 20-mer on either side of a restric 
tion srte. The resulting dones are sequenced and a number chosen for use as vectors Two oVta^!^- V 
pared for each vector chosen, one homologous to the seauence at M rh 1,^ ^.1 oligonucleotides are pre- 
so that the 3 -end is towards toe restriction^ ^^S^d^^^J^^SlT" ""A"* 
base in the sequencing reactions and each one bbeled SSVSSSSl tog P ™ *" 

1 1. AqVgntariBS Of SeniiPncina bv tha Use of Ra^ fHff x ny . 

[0194] There are substantial advantages when cleavahia tans aro , 

obtain read lengths greater than 450 nt nuc| oot.des (nt). for example). This w.11 greatly .mpact the ability to 
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[0195] The use of deavaWe multiple tags (MW identifiers) also has the advantage that multiple ONA samples can 
be run on a single gel lane or separation process. For example, it is possfole using the methodologies disclosed herein 
to combine at least 96 samples and 4 sequencing reactions (A.G.T.C) on a single lane or fragment sizing process. If 
multiple vectors are employed which possess unique priming sites, ten at least 384 samples can be combined per gel 
lane (the different terminator reactions cannot be amplified together with this scheme). When the ability to employ 
cleavable tags is combined with the ability to use multiple vectors, an apparent 10.000-ibld increase in DNA sequencing 
thoughput is achieved. Also, in the schemes described herein, reagent use is decreased, disposables decrease, with a 
resultant decrease in operating costs to the consumer. 

[01 96J An additional advantage is gained from the ability to process internal controls throughout the entire method- 
ologies described here. For any set of samples, an internal control nudeic acid can be placed in the sample(s). This is 
not possible with the current configurations. This advantage permits the control of the amplification process, the sepa- 
ration process, the tag detection system and sequence assembly. This is an immense advantage over current systems 
in which the controls are always separated from the samples in all steps, 

[0197] The compositions and methods described herein also have the advantage that they are modular in nature 
and can be fitted on any type of separation process or method and in addition, can be fitted onto any type of detection 
system as improvements are made in either types of respective technologies. For example, the methodologies 
described herein can be coaled with "bundled" CE arrays or microfabricated devices that enable separation of DNA 
fragments. 

« 

C. SEPARATION OF DNA FRAGMENTS 

[0198J A sample that requires analysis is often a mixture of many components in a complex matrix For samples 
containing unknown compounds, the components must be separated from each other so that each individual compo- 
nent can be identified by other analytical methods. The separation properties of the components in a mixture are con- 
stant under constant conditions, and therefore once determined they can be used to identify and quantify each of the 
components. Such procedures are typical in chromatographic and electrophoretic analytical separations. 

1- Hiah-Performance Liquid Chromatography ( HPLC) 

[0199] High-Performance liquid chromatography (HPLC) is a chromatographic separations technique to separate 
compounds that are dissolved in solution. HPLC instruments consist of a reservoir of mobile phase, a pump, an injector 
a separation column, and a detector. Compounds are separated by injecting an aliquot of the sample mixture onto the 
column. The different components in the mixture pass through the column at different rates due to differences in their 
partitioning behavior between the mobile liquid phase and the stationary phase. 

[0200] Recently. IP-RO-HPLC on non-porous PS/DVB particles with chemically bonded alkyl chains have been 
shown to be rapid alternatives to capillary electrophoresis in the analysis of both single and double-strand nucleic acids 
providing similair degrees of resolution (Huber et al, 1993. Anal.Biochem., 212. p351; Huber et ah. 1993. Nuc. Acids 
Res.. 21. p1061; Huber et al.. 1993. Biotechniques, 16. p898). In contrast to ion-excahnge chromoatrography. which 
does not always retain double-strand DNA as a function of strand length (Since AT base pairs intereact with the posi- 
tively charged stationary phase, more strongly than GC base-pairs). IP-RP-HPLC enables a strictly size-dependent 
separation. 

[0201 ] A method has been developed using 1 00 mM triethytammonium acetate as ion-pairing reagent phosphodi- 
ester oligonudeotides could be successfully separated on alkylated non-porous 2.3 \iM poly(styrene-divinylbenzene) 
partides by means of high performance liquid chromatography (Oefner et al., 1994. Anal. Biochem., 223 p39) The 
technique described allowed the separation of PGR products differing only 4 to 8 base pairs in length within a size range 
of 50 to 200 nucledtides. 

2- Electrophoresis 

[0202] Electrophoresis is a separations technique that is based on the mobility of ions (or DNA as is the case 
described herein) in an electric field. Negatively charged DNA charged migrate towards a positive electrode and posi- 
tively-charged ions migrate toward a negative electrode. For safety reasons one electrode is usually at ground and the 
other is biased positively or negatively. Charged species have different migration rates depending on their total charge 
size, and shape, and can therefore be separated. An electrode apparatus consists of a high-voltage power supply, elec- 
trodes, buffer, and a support for the buffer such as a polyacryiamide gel. or a capillary tube. Open capillary tubes are 
used for many types of samples and the other gel supports are usually used for biological samples such as protein mix- 
tures or DNA fragments. 
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3. Capillary Electrophoresis (CE) 

[02031 Capillary electrophoresis (CE) in its various manifestations (free solution, isotachophoresis. isoelectric 
Ibcuang. po.yacry.am.de gel. micellar electrokinetic -chromatography") is developing as a 

JIS^ . . ^potential powerful technique for bioanalysis. In the novel application disclosed herein the 
mtenacng of these two methods wil. lead to superior DNA sequencing methods that ec£se the cuSn^te m^c2 
of sequencing by several orders of magnitude. n rara menioas 

. ff^L , ™ e despondence between CE and electrospray ionization (ESI) flow rates and the fact that both are fad 

2S2£1 P ST* T* ^ ^ SP6d8S so,ution I""* ^ an extremely Active contoSon £e 
ZTl 'ZTJ^ S5? eleCtr ° pfl0reas < CZE > and capiflary isotachophoresis wrth qiiadrW^so* 
2T2T?c2?? ^ h3Ve beef1 described ( 0,iv a^ et al.. Anaf. Cnem. 59:1230. 1987- SmtoetaT 

umn Sep. f 223. 1989. Lee et al. J. C/woma/pg. 458:313. 1988; Smith et al.. J. Chromatin 1989- rJ^L 

' ' ' »«* 1969 >- Sma " »• -* amenable to CZE ^Z^^T^ 

[02051 The most powerful separation method tor DNA fragments is polyacrylamide ael eleclronh««*i* tPim 
gene^ly in a slab gel forma,. However, the major limitation of the current £2S£!£ S£2SZEZ reS 
to perform the gel electrophoresis of DNA fragments produced in the sequencing reactons JStSSSSST!^ 

charge^ .^amide gete. DNA fra 9ments steve arrinrigrate^ as SSS ^ZXSZ££ZZ£ 

»rp*m^ 

be employed tor DNA sequencing. The method of £ ist pn^e^rnes^ ™ 
standard sequencer. For example, about 300 bases can be read per hour The seoaratton 1 
elec^ophoresis by the magnitude of toe electric field which can be £££ ^*3£ ^SSS^^SSS 
Therefore toe greater speed of CE is achieved through the use of higher field i«o^wf3S?SS lEST 0 
gene^oT "* -actoph ~»- ^ *"« the amperage and K anTtoe resuSeS 

onsjated h,gh through-put sequencing (Huang et al.. Ana/. Chem. 64:967, 1992. Huang et^ AnT^em XSS" 
1 992). The major d,sadvantage of capillary electrophoresis is the limited amount of samp.e » toat ca^eSed^ S 
cap.llary. By concerting a large amount of sample at the beginning of the capillary pdor to se^ratioT^riS^ 
.ncreased. and detection levels can be lowered several orders* magnitude The mo^Dc^Sa^^J ™ ^ S 
tration to CE is sample stacking. Sample stacking has recently been review^ SenTndTrgT SEJSSTSEE 
1992). Sample stacking depends of toe matrix difference. (pH. ionic strength) betweentoe &J«Zt*~2JZ * 
illary buffer, so that the electric field across the sample zone is rnore^Ttoe^Cr^STln ^lel^a 
large volume of sample in a low concentration buffer is introduced tor preccweirtratton^ ltth« hl'Ji ^^1. 
umn. The capillary is filled with a buffer of the same composiSi Z^^^^^^SIT cap..lary col- 
reach toe capillary buffer and the lower electric fieto.7ey SS^K at^StewTSS 
increased detectabilHies 1 -3 orders of magnitude. Jsampie stacking has 

[02071 Another method of preconcentraticn is to apply isotachophoresis (UP) prior to the free zone CE senary™, 
of analytes. .TP is an electrophone technique which allows microliter volumes of sample to be toa^ on t me^L? 
lary. in contrast to toe low nL injection volumes typically associated with CE. The tech^^eH^intSna TjSt' 
Pte between two buffers (leading and trailing electrolytes) of higher and lower fn±m^S^T^^S^ 
The technique is .nherently a concentration technique, where the analytes concentrate irrto pure 2,^1!^ *£t 
thesame speed. The technique is current.y less popular than the stacking methods desSb* ab^beSSTSl I« 

[0208] The heart of the DNA sequencing process is the remarkably selective electroohoretic separation m nw* 
o..gonucleotide fragments. It is remarkable because each fragment is resolved andSSSS ^SoSe Seo^« 
tons of up to 1000 fragments (1000 bp) have been obtained. A further advantage of ZSZTd^e Sl' 

fjf^ 1 "° r , 6qU,rement t0 ^ a ae » to ' «« when DNA fragments are sXSb^ya^m^faS 
elec^ophoresis when cleavable tags are employed. Since numerous samples are JSEZ HA i^SSSStSl nt 
need to run samples ,n parallel as is the case with current dye-primer or dye-terminator metoccte (/ «T ABK373 
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sequencer). Since there is no reason to run parallel lanes, there is no reason to use a slab gel. Therefore one can 
employ a tube gel format far the electrophoretic separation method. Grossman (Grossman et al.. Genet Anal Tech 
AppL 9 9, 1 992) have shown that considerable advantage is gained when a tube gel format is used in place of a slab 
gel format This is due to the greater ability to dissipate Joule heat in a tube format compared to a slab gel which results 
m faster run times (by 50%). and much higher resolution of high molecular weight DNA fragments (greater than 1000 
nt). Long reads are critical in genomic sequencing. Therefore, the use of cfeavable tags in sequencing has the addi- 
tional advantage of allowing the user to employ the most efficient and sensitive DNA separation method which also as- 
sesses the highest resolution. 



4. Mi 



Mated Devices 



[0209J Capillary electrophoresis (CE) is a powerful method for DIMA sequencing, forensic analysis. PCR product 
analysis and restriction fragment sizing. CE is far faster than traditional slab PAGE since with capillary gels a far hiqher 
potential field can be applied. However. CE has the drawback of allowing only one sample to be processed per gel The 
method combines the raster separations times of CE with the ability to analyze multiple samples in parallel The under- 
lying concept behind the use of microfabricated devices is the ability to increase the information density in electrophore- 
sis by miniaturizing the lane dimension to about 100 micrometera. The electronics industry routinely uses 
mtcrofabncatjon to make circuits with features of less than one micron in size. The current density of capillary arrays is 
imited the outside diameter of the capillary tube. Microfebrication of channels produces a higher density of arrays 
Microfabncatjon also permits physical assemblies not possible with glass ffeers and links the channels directly to other 
devices on a chip. Few devices have been constructed on microchips for separation technologies. A gas chromatograph 

aTZT ™ff f* I*?™ DeViCe ' e °- 26:188 °- 1979 > a«l * liquid chromatograph (Manz et al.. Sens. ActuaZs 
B 1 249. 1 990) have been fabricated on silicon chips, but these devices have not been widely used. Several groups have 
f ^o« S ^ 3t jy "uorescent dyes and amino acids on microfabricated devices (Manz et al.. J. Chromatography 
593:253. 1992. Effenhauser et al.. Anal. Chem. 55:2637. 1993). Recently WooHey and Mathies (Wodley and Ivlathies 
Proa NatLAcad. Sci. 91 :11348. 1994) have shown that photolithography and chemical etching can be used to make 
large numbers of separation channels on glass substrates. The channels are filled with hydroxyethyl cellulose (HEC) 
separation matrices, ft was shown that DNA restriction fragments could be separated in as little as two minutes. 

D. CLEAVAGE OF TAGS 

[021 0] As described above, different linker designs will confer cleavability riabiBty") under different specific physical 
or chemical conditions. Examples of conditions which serve to cleave various designs of linker include acid base oxi- 
dation, reduction, fluoride, thiol exchange, photolysis, and enzymatic conditions. 

general criteria for linkers listed above will be well known to 
those in the art and include those found in the catalog available from Pierce (Rockford. IL). Examples include: 

ethylene glycobis(succinimidvlsuccinate) (EGS), an amine reactive cross-linking reagent which is cleavable bv 
hydroxylamine (1 M at 37°C for 3-6 hours); 1 

disuccinimidyl tartarate (DST) and sulfo-DST. which are amine reactive cross-linking reagents, cleavable bv o 015 
M sodium periodate; 

bis[2-(sucdnimfoyloxycarbonylc3xy)ethyl]surfone (BSOCOES) and sulfo-BSOCOES. which are amine reactive 
cross-linking reagents, cleavable by base (pH 1 1 .6); ~ " 

1.4^i-[3-(2^pyridvfoithio(propionamido))butane (DPDPB). a pyridyUithiol crosslinker which is cleavable bv thiol 
exchange or reduction; ' 

N-t^p-azktosalicyl^^ (APDP) a wnjcn j_ 

cleavable by thiol exchange or reduction; 

rSuction 4 " (Ki£fc ^ icy,amido)etny,J " disulf We ' a Photoreactive crosslinker which is cleavable by thiol exchange or 

N-succinimidyl-(4-azidophenyl)-l.3 < dithiopropionate (SADP). a photoreactive crosslinker which is cleavable by thiol 
exchange or reduction; 7 

sulfosucdniniidyl-2-(7-azido-4-memvlcoun^ (SAED) a photoreactive 

crosslinker which is cleavable by thiol exchange or reduction; 

sulfosucdnimidyl^^m-azfooKJ-nitrobenzaznidoJ-ethyl-l.a'dithiopropionate (SAND), a photoreactive crosslinker 
which is cleavable by thiol exchange or reduction. 

[021 2) Other examples of cleavable linkers and the cleavage conditions that can be used to release tags are as fol- 
lows. A silyl linking group can be cleaved by fluoride or under acidic conditions. A 3-. 4-. 5-. or 6-substrtuted-2-nitroben- 
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zyloxy or 2-. 3-. 5-. or 6-substituted-4-nitroben Z ylQxy linking group can be cleaved by a photon source (photolysis) A 3- 

4- 5-. or S-substituted-2-altoxyphenoxy or 2-. 3-. 5-. or 6-substttited-4-a.l^ 
by Ce(NH 4te (N03) 6 (oxidation). A NCO, (urethane) linker can be cleaved t^rM^^^^I^S 
Jon) A 3-pentenyf. 2-butenyl. or 1-butenyt linking group can be cleaved by 0 3 . OsOVKV. or l^rOWo^dation A? 
[3-. 4-. or 5-substituted-fur y l]oxy linking group can be cleaved by Br 2 . MeOH. or acid 4 (0MlatOn) - A *' 

31 J^*j** **** deava 9 e <* other 'ab«e linking groups include: t-alkyloxy linking groups can be cleaved bv 
S : ^ ett ? ( 2 alky,)m6lhOXy Z t subs «^ed-2-aJkyl-1.3<lioxlane-2-y. linking groups c£Lte^% £"*t 
J«TZl Z g 9 ^Tl deaV6d * flU0ride 0f acid: 2-00-ethoxy (where X - keto. ester amide, ^no! 
sulfide sulfox.de. sulfone) Lntang groups can be cleaved under alkaline conditions; 2-. 3-. 4-. 5-. or 6-sub2we^be£ 
2?5 p, p^^T^i! be cleaved by acid or under reductive conditions; 2-butenytoxy linking groups ca^eawS 
if ^ ( w ' ■ ' ^ W 6 -^s«ut«*-2-broroophenoxy linking groups can be deaved by ^.T&L me^ 
yim^metfioxy Unking groups can be cleaved by Hg*-; 2-(X>etny1oxy (where X = a hatogenMnWn?grouM TcTbe 
f™** «>y Zn or Mg; 2-hydroxyethyloxy linking groups can be cleaved by oxidation ( e.gZ*L PbToA?) j 

" n ^ rs are those are cleaved by acid or photolysis. Several of the acid-labile linkers that have 
rZlZZ 0 * 3 J° ^, WM9 Synthesis are ^ linking tags to MOIa Some of ftesHntensa^ 

descnbed in a recent review by Lloyd-Williams etal. (TefraheoVon 43:11065-11 133. 1993). One useful ^eoftSe^ 

SSJSH r Sr y ^ C ° h0b ' 01 ^ ^"ydroxymetny t henoxyac«ic acid and wCZ 3 
uTS^"^*"" ac,d ; « «™eraally available from Advanced ChemTech (Uuisvilte. I^.^Z^can 
£ , « 9 t T J"^ 98 10 th9 benzylalcohol. and to an amine-containing MOI via an amide S 
the ca*oxyte acri. Tags l.nked by these molecules are released from the MOI with varyfng concentration J EX£ 

attached through related linkers, such as 2.4^im 6 thoxy-4^(cartx 3 xymethyloxy)^en z hydry^am 1 ne (availaSr^m 

a caiboxylic amide on the released taa 
1021 5] The photolabile linkers useful for this application have also been for the most part develooed for soiw nhaco 
peptide synthesis (see Uoyd-Wi.liams review). These linkers are usual* based on 2^2f W «iS 
'amrfes. Two ^examples of photolabile linkers that have recently been reported in the literature ar "ZfSlZ. 
am,no)ethyl)-2-methoxy-5-n,trophenoxy,butanoic acid (Holmes and Jones. J. Org. Chem. 6Y723 18-231 9 laS u£l 
(Fmoc-amino)-3-(2-nitrophenyl)propionic acid (Brown et ai.. Molecular Diversity 14-12 1995) Both Mm. 

am.de between a carooxylic acid on the tag and the amine on the linker. Cleavage of photolabile linkers fe^ fT 
formed wrth UV Hghtof 350 nm wavelength at intensities and times known to those in 5£SSS5 2 cSeSai 
sources of .nstruments for photochemical cleavage are Aura Industries Inc. (Staten Island. NY^ZZS^Z^ 
ington, MA). Cleavage of the linkers results in .iberation of a primary amide on the tag. SSnpies SphXES^ 
ers include nrtrophenyl glycine esters, exo- and erxto-2-benzonorborneyl chlorides w^^T^S^t^t 
a^3(2-niticpheny.) propionic add. Exanp.es of enzymatic cleavage include esterases^ TSESfZi 
bonds, nucleases wh,ch will cleave phosphodiester bonds, proteases which cleave peptide tonZetc 

E. DETECTION OF TAGS 

[021 6] Detection methods typically rely on the absorption and emission in some type of spectral field When atoms 
^molecules abso* light, the incoming energy excites a quantized structure to a highl enen^Sve. The o?S 
tat.cn depends on the wavelength of the light. Electrons are promoted to higher oLals by ultravtoU or vELe fa£ 
molecular vibrations are excited by infrared light, and rotations are excited by rrfcrowaves . AnSn£ 
me absorption of light as a function of wavelength. The spectrum of an atom or molecule O^S^S^ZZ 
structure Absorption spectra are useful for identification of compounds. Specific absorption specific method 
include atomic absorption spectroscopy (AA). infrared spectroscopy (IR). and UV-vis spectiosc^vZS 

° r 6XCited to hf 9 h levels can decay to lower leXby emitting radiation 

This tight em.ss.on ,s called fluorescence if the transition is between states of the same spin, and phospho?es7enS"f 
the transition occurs between states of different spin. The emission intensity of an analyte telinearly pm££!EE5£ 

mtSr ^ T C ° nCentrat0nS, ■ ^ iS USefU ' for qUanttyin 9 *° emrttin 9 secies. ^Specific eniiic^ecfoSopfo 
methods ; include atomic em.ss.on spectroscopy (AES). atomic fluorescence spectroscopy (AFS) molec^Sr 
induced fluorescence (LIF), and X-ray fluorescence (XRF). moiecuiar laser- 

[0218] When electromagnetic radiation passes through matter, most of the radiation continues in its original direc- 
tion but a small fraction is scattered in ether directions. Light that is scattered at the same wavelength aL SSSlt 
hghi is called Rayleigh scattering. Light that is scattered in transparent solids due to t*£T52£l3 

^l? T n l : tou,n scat ! er,n , 9 is typica,,y shifted by 01 to 1 wave number from ^e wdJEiBhTSn! SZ t 

scattered due to v.brations in molecules or optical phonons in opaque solids is called Raman scattering Raman ^Lf 
tered light ,s shifted by as much as 4000 wavenumbers from the incident light Specific scattering specLcWmSh 
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ods include Raman spectroscopy. 

[021 9] I R spectroscopy is the measurement ot the wavelength and intensity of the absorption of mid-infrared light 
by a sample. Mid-infrared light (2.5 - 50 nm. 4000 - 200 cm 1 ) is energetic enough to excite molecular vibrations to 
h.gher energy levels. The wavelength of IR absorption bands are characteristic of specific types of chemical bonds and 
IR spectroscopy is generally most useful for identification of organic and organometallic molecules 
[02201 Near-infrared absorption spectroscopy (NIR) is the measurement of the wavelength and intensity of the 
absorption of near-infrared light by a sample. Near-infrared light spans the 800 nm - 2.5 M m (12,500 - 4000 cm" 1 ) range 
and is energetic enough to excite overtones and combinations of molecular vibrations to higher energy levels NIR 
spectroscopy is typically used for quantitative measurement of oiganic functional groups, especially O-H N-H and 
C=0. The components and design of NIR instrumentation are similar to uv-vis absorption spectrometers. The liaht 
source is usually a tungsten lamp and the detector is usually a PbS solid-state detector. Sample holders can be glass 
or quartz and typical solvents are CCI 4 and CSj. The convenient instrumentation of NIR spectroscopy makes it suitable 
for on-line monitoring and process control. 

[02211 Ultraviolet and Visible Absorption Spectroscopy (uv-vis) spectroscopy is the measurement of the wave- 
length and intensity of absorption of near-ultraviolet and visible light by a sample. Absorption in the vacuum UV occurs 
at 100-200 nm; (10 5 -50.000 cm 1 ) quartz UV at 200-350 nm; (50.000-28.570 cm" 1 ) and visible at 350-800 nm (28 570- 
12.500 cm- ) and is described by the Beer-Lambert-Bouguet law. Ultraviolet and visible light are energetic enough to 
promote outer electrons to higher energy levels. UV-vis spectroscopy can be usually applied to molecules and inorganic 
ions or complexes in solution. The uv-vis spectra are limited by the broad features of the spectra. The light source is 
usually a hydrogen or deuterium lamp for uv measurements and a tungsten lamp for visible measurements. The wave- 
lengths of these continuous light sources are selected with a wavelength separator such as a prism or grating mono- 
chromator. Spectra are obtained by scanning the wavelength separator and quantitative measurements can be made 
from a spectrum or at a single wavelength. 

(02221 Mass spectrometers use the difference in the mass-to-charge ratio (m/z) of ionized atoms or molecules to 
separate them from each other. Mass spectrometry is therefore useful for quantitation of atoms or molecules and also 
for determining chemical and structural information about molecules. Molecules have distinctive fragmentation patterns 
that provide structural information to identify compounds. The general operations of a mass spectrometer are as fol- 
lows. Gas-phase ions are created, the ions are separated in space or time based on their mass-to-charge ratio and the 
quantity of ions of each mass-to-charge ratio is measured. The ion separation power of a mass spectrometer is 
described by the resolution, which is defined as R - m / delta m . where m is the ion mass and delta m is the difference 
in mass between two resolvable peaks in a mass spectrum For example, a mass spectrometer with a resolution of 
1000 can resolve an ion with a m/z of 100.0 from an ton with a m/z of 100.1. 

[0223] In general, a mass spectrometer (MS) consists of an ion source, a mass-selective analyzer and an ion 
detector. The magnetic-sector, quadrupole, and time-of-f light designs also require extraction and acceleration ion optics 
to transfer ons from the source region into the mass analyzer. The details of several mass analyzer designs (forraa- 
netic-sector MS. quadrupole MS or time-of-flight MS) are discussed below. Single Focusing analyzers for magnetic- 
sector MS utihze a particle beam path of 180. 90. or 60 degrees. The various forces influencing the particle separate 
ions with different mass-to-charge ratios. With double-focusing analyzers, an electrostatic analyzer is added in this tvoe 
of instrument to separate particles with difference in kinetic energies. 

[0224] A quadrupole mass filter for quadrupole MS consists of tour metal rods arranged in parallel. The applied volt- 
ages affect the trajectory of ions traveling down the flight path centered between the tour rods. For given DC and AC 
voltages, only ions of a certain mass-to-charge ratio pass through the quadrupole filter and all other ions are thrown out 
of thar original path. A mass spectrum is obtained by monitoring the ions passing through the quadrupole filter as the 
voltages on the rods are varied. 

[0225] A time-of-flight mass spectrometer uses the differences in transit time through a "drift region" to separate 
ions of different masses. It operates in a pulsed mode so ions must be produced in pulses and/or extracted in pulses 
A pulsed electric field accelerates all ions into a field-free drift region with a kinetic energy of qV. where q is the ion 
charge and V is the applied voltage. Since the ion kinetic energy is 0.5 mV* lighter ions have a higher velocity than 
heavier ions and reach the detector at the end of the drift region sooner. The output of an ion detector is displayed on 
an oscilloscope as a function of time to produce the mass spectrum. 

[0226] The ion formation process is the starting point for mass spectrometric analyses. Chemical ionization is a 
method that employs a reagent ion to react with the analyte molecules (tags) to form ions by either a proton or hydride 
transfer. The reagent ions are produced by introducing a large excess of methane (relative to the tag) into an electron 
impact (El) .on source. Electron collisions produce CH 4 * and CH 3 * which further react with methane to form CH S * and 
CzHs . Another method to ionize tags is by plasma and glow discharge. Plasma is a hot partially-ionized gas that effec- 
tively excites and ionizes atoms. A glow discharge is a low-pressure plasma maintained between two electrodes Elec- 
tron impact ionization employs an electron beam, usually generated from a tungsten filament to ionize gas-phase 
atoms or molecules. An electron from the beam knocks an electron off analyte atoms or molecules to create ions Elec- 
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trospray 'orwat,on utilizes a very fine needle and a series of skimmers. A sample solution is sprayed into the source 
chamber to form droplets The droplets carry charge when the exit the capilSyand as the sc^va^es LTcS 
lets d.sappear leaving highly charged analyte molecules. ESI is part.cu.ady useL for large bSS^SSlteS 
^1° 8 "r ft F , aSt " at ° m ^"^ent (FAB) utilizes a high-energy beam of SaSTS? 
or Ar that strikes a soW sample causing desorption and ionization. It is used tor large biological moleoJl JSEr. «* 
f icutt to aet into the aas ohasa far im>~ w~ . . f ^ mwecuies mat are dif- 



though a charge-exchange cell. The ions pick up an electron in collisions with neutral atoms to form a^eZ oThS 
energy atoms. Laser ionization (UMS) is a method in which a laser pulse ablates materiaTfrom LiSi^lSI 
» and creates a microplasma that ionizes some of the sample consttuents MaSx^sSed^Jr rt^^? ^ 
(MALOI) is a UMS method of vaporizing and ionizing *Z tSSSZEZ stS, as^ pS^^fTmeT 
The b,ological molecules are dispersed in a solid matrix such aTnicotinic aci? AW tes^outee^S^ 

T * 01 ^ m ° ,eCUleS int ° ,he 9 * S * hase an fonized form so ti^ey « K£S££ Vnt£ 
spectrometer. Plasma^esorption ionization (PD) utilizes the decay of 252 Cf which producestwo fS^S^JT^ 
r travel in opposite directions. One fragment strikes the samp.Tknocking out Til Ta^yS ™ 
2 ™. 3 T"? ^ers the start of data acquisition. Ss ionization" mlod £ ^e^l fof^e b LT 
cal molecule. Resonance ionization (RIMS) is a method in which one or more laser beams aTet™*£ re£™?£ 
traditions of a gas-phase atom or molecule to promote it in a stepwise fashion above^ntzat^^e^^S 
an .on. Secondary .onization (SIMS) utilizes an ion beam; such as W, 1s O* or <°Ar*- is to^ZT, 1 J* 5 
a sample and sputters materia, into the gas phase. Spark source is a method 'Sj££!ZSE£ sot^S* 
by pulsing an electric current across two electrodes. analytes in sold samples 

[02271 A teg may become charged prior to. during or after cleavage from the molecule to which it is attache lo™ 
za^ methods based on ion "desorption". the direct formation or. Mission of ions (Tl c^ sJdfcet'htve 
allowed increasing application to nonvolatile and thermally labile compounds. These memods aH^-TZ J?? 
neutral molecule volatilization prior to ionization and generally rrinin^^^t^ ^Z^Z 
These methodsi .ncfude field desorption (Becky. Principtes of F,eki Ionization and Sid De^mtioVl^^^Z 
Pergamon Oxford. 1977). plasma desorption (Sundqvist and Madarlane. Mass Spec^m n£ 7Sf^T R Z 
^?°? ton (Karas and Hillenkamp. Anal. Chem. 602299. 1988: Karas et al.. AngewChem W1 80& iga^SJn!. 
tide bombardment (e.g.. fast atom bombardment. FAB. and secondary ion maafsDert™!. Jh* 25q S^lT 
Ana,. Chem 54^ 1982). and thermospray (TS, ionization (^mZ^^Z 7^9^ ^J^ 
pray rs broadly applied for the on-line combination with liquid chromatography. TnTconLous ffoVFAB^2h^^" 
no., et a... Anal. Chem. 55:2949. 1986) have also shown significant pZSj A rr»re c^Ze tSSS. SSS^S 
spectrometry combinations is ion-trap mass spectrometry, electrospray ionization nvn^tanS™ 3SS 
spectrometry, liquid ionization mass spectrometry. afrnosSneric pr^i foSSn maS ZlT 
bon mass spectrometry, metastable atom bombardment ionization mass sr^omefr7te2ain« V^JS^ * 
mass spectrometry. MALDI mass spectrometry. . photo-ionization ^^^J^sp^c^ 
spectrometry. MALDI-TOF mass spectrometry. APCI mass spectrometry nano^^m^^SJ^^^ 

s^ n r iZati ° n T* SP8Ctr ° me,ry - i0ni2a «°" ^^oZy^^^S^^^^ 

secondary ionization mass spectrometry, thermospray mass spectrometry. spectrometry. 

[0228] The ionization methods amenable to nonvolatile biological compounds have overianninn ra ™~. 

'-fff: eWidenCie ? W9h,y d6pendent °" compo'sttion .^^SS 
indicate that the upper molecular mass for TS is about 8000 daftons (Jones andKrolik. Ran* T£. 
r :67 1987). Since TS is practiced main.y with quadrapo.e mass s^«^7^ 
tionately at higher mass-charge ratios (nVz). Time-of-ffight (TOF) mass spectrometer SSSSSi SaSL" 
and possess the advantage that the m/z range is limited only by detector efftaency Recentiv twn^ ^i ^ 
methods have been introduced. These two methods are now referred to as ZZS^SSS 
Karas and Hillenkamp. Anal. Chem. 60:2299. 1988; Karas et al.. Anoew Cnem / 0 1MS1<^^ ,?1 ( 
ization (ESI). Both methodologies have very high ionization e^iencyTe ifS^SSUSS? T 
cu.es consumed]). Sensitivity, which defines the ultimate potential of the iJWuJTS^SlS 
quantrty of ions, flow rate, detection efficiency and actual ionization efficiency. ^ 00 Sample s,ze - 

[0229] Electrospray-MS is based on an idea first proposed in the 1960s (Dole et al J Chem Ph^ ao ooa* 
1968). Electrospray ionization (ESI) is one means to produce charged molecules i analysis bvma'J^ 
Briefly, electrospray ionization produces highly charged droplets by nebulizing Ifo^^ 

h.ghlycharged drop.ete. generally formed in a dry bath gas at atmospheric pressure, shrank o! tSSS£ Z 12 
solvent unt.. the charge repulsion overcomes the cohesive forces, leading to a "Coulombic mS£^T^!Sf 
amsrn of .on.zat.on is controversial and several groups have put forth hypotheses (Blades Ana/' ChlT^nt 
4. 1991 Kebarleeta... Ana,. Chem. 65.A972-86. 1993: Fenn. J. Am. Soc. te K S 
less of the ultimate process of ion formation. ESI produces charged molecules from TsolutiTn urS^Sid SSitionT 
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[0230] The ability to obtain useful mass spectral data on small amounts of an organic molecule relies on the effi- 
cient production of ion& The efficiency of ionization for ESI is related to the extent of positive charge associated with 
the molecule. Irrproving ionization experimentally has usually involved using acidic conditions. Another method to 
improve ionization has been to use quaternary amines when possible (see Aebersdd et a!.. Protein Science 1 494-503 
1992; Smith et al.. Anal. Chem. £0:436-41, 1988). 

[0231 J Electrospray ionization is described in more detail as follows. Bectrospray ion production requires two steps: 
dispersal of highly charged droplets at near atmospheric pressure, followed by conditions to induce evaporation. A solu- 
tion of analyte molecules is passed through a needle that is kept at high electric potential At the end of the needle, the 
solution disperses into a mist of small highly charged droplets containing the analyte molecules. The small droplets 
evaporate quickly and by a process of field desorption or residual evaporation, protonated protein molecules are 
released into the gas phase. An electrospray is generally produced by application of a high electric field to a small flow 
of liquid (generally 1-10 uL7min) from a capillary tube. A potential difference of 3-6 kV is typically applied between the 
capillary and counter electrode located 0.2-2 cm away (where Ions, charged clusters, and even charged droplets, 
depending on the extent of desolation, may be sampled by the MS through a small orifice). The electric field results in 
charge accumulation on the liquid surface at the capillary terminus; thus the liquid flow rate, resistivity, and surface ten- 
sion are important factors in droplet production. The high electric field results in disruption of the liquid surface and for- 
mation of highly charged liquid droplets. Positively or negatively charged droplets can be produced depending upon the 
capillary bias. The negative ion mode requires the presence of an electron scavenger such as oxygen to inhibit electri- 
cal discharge. 

[0232] A wide range of liquids can be sprayed electrostatically into a vacuum, or with the aid of a nebulizing agent. 
The use of only electric fields for nebulization leads to some practical restrictions on the range of liquid conductivity and 
dielectric constant Solution conductivity of less than 10 s ohms is required at room tenperature for a stable electro- 
spray at useful liquid flow rates corresponding to an aqueous electrolyte solution of < 10" 4 M. In the mode found most 
useful for ESI-MS. an appropriate liquid flow rate results in dispersion of the liquid as a fine mist. A short distance from 
the capillary the droplet diameter is often quite uniform and on the order of 1 pm. Of particular importance is that the 
total electrospray ion current increases only slightly for higher liquid flow rates. There is evidence tat heating is useful 
for manipulating the electrospray. For example, slight heating allows aqueous solutions to be reacfily eJectrosprayed. 
presumably due to the decreased viscosity and surface tension. Both thermally-assisted and gas-nebulization-assisted 
electrosprays allow higher liquid flow rates to be used, but decrease the extent of droplet charging. The formation of 
molecular ions requires conditions effecting evaporation of the initial droplet population. This can be accomplished at 
higher pressures by a flow of dry gas at moderate temperatures (<60°C). by heating during transport through the inter- 
face, and (particularly in the case of ion trapping methods) by energetic collisions at relatively low pressure, 
[02331 Although the detailed processes underlying ESI remain uncertain, the very small droplets produced by ESI 
appear to allow almost any species carrying a net charge in solution to be transferred to the gas phase after evaporation 
of residual solvent. Mass spectrometry detection then requires that ions have a tractable m/z range (<4000 dartons for 
quadrupole instruments) after desolvation. as well as to be produced and transmitted with sufficient efficiency. The wide 
range of solutes already found to be amenable to ESI-MS. and the lack of substantial dependence of ionization effi- 
ciency upon molecular weight suggest a highly non-discriminating and broadly applicable ionization process. 
[0234] The electrospray ion "source" functions at near atmospheric pressure. The electrospray "source" is typically 
a metal or glass capillary incorporating a method for electrically biasing the liquid solution relative to a counter elec- 
trode. Solutions, typically water -methanol mixtures containing the analyte and often other additives such as acetic acid 
flow to the capillary terminus. An ESI source has been described (Smith et al.. Anal. Chem 52:885. 1990) which can 
accommodate essentially any solvent system. Typical flow rates for ESI are 1-10 uUmin. The principal requirement of 
an ESI-MS interface is to sample and transport ions from the high pressure region into the MS as efficiently as possible. 
[0235] The efficiency of ESI can be very high, providing the basis for extremely sensitive measurements, which is 
useful for the invention described herein. Current instrumental performance can provide a total ion current at the detec- 
tor of about 2 x 10' 12 A or about 10 7 counts/s for singly charged species. On the basis of the instrumental performance 
concentrations of as low as 10' 10 M or about 10" 18 mol/s of a singly charged species will give detectable ion current 
(about 10 countsJte) if the analyte is completely ionized. For example, low attomole detection limits have been obtained 
for quaternary ammonium ions using an ESI interface with capillary zone electrophoresis (Smith et al.. Anal. Chem. 
59:1230, 1988). For a compound of molecular weight of 1000. the average number of charges is 1. the approximate 
number of charge states is 1 . peak width (m/z) is 1 and the maximum intensity (ion/s) is 1 x 10 12 . 
[0236] Remarkably little sample is actually consumed in obtaining an ESI mass spectrum (Smith et al. Anal Chem 
60:1948. 1988). Substantial gains might be also obtained by the use of array detectors with sector instruments, allowing 
simultaneous detection of portions of the spectrum. Since currently only about 10 s of all ions formed by ESI are 
detected, attention to the factors limiting instrument performance may provide a basis for irrproved sensitivity. It will be 
evident to those in the art that the present invention contemplates and accommodates for improvements in ionization 
and detection methodologies. 
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102371 ^ inte ^e is preferably placed between the separation instrumentation (e.g.. gel)and the detector (e a 
mass spectrometer). The interface preferably has the following properties: (1) the abiWyto a£*£\ OwS^^L 
at discreet tan. intervate. (2) concentrate the ONA fragments. (3, remove the DNA fragment™ Ee 2£5E3l 

oTJTn^ T ■ SfSHT *V* 9 fr ° m the DNA "a*™* < 5 > the tag from the DNA^eT^dlspS 

of the DNA fragment. (7) place the tag in a volatile solution. (8) volatilize and ionize the tag. and S^bce a r traiSS 
the tag to an electrospray device that introduces the tag into mass spectrometer. transport 
(0238J The interlace also has the capability of "collecting" DNA fragments as they elute from the bottom of a oel 

JZZ H t T^S* I Sl3b 9e> - a tubular a caP" 13 '* etc. The DNA fragments can be SecCbVsieS 
method* The f.rst method is that of use of an electric field wherein DNA fragments Te collecteTonto or ™ ^2? 

' Z iS that Wherein the DNA ^ents are collected byTowing a £2S£5 ^Te fc^m 

A *^ 8 5 th me1h0dS 030 be combined wherein DNA colected into a flowing streaS whSfca^ 
concentrated by use of an electnc field. The end result is that DNA fragments are removed from the mSieu u^JTLhf* 

I ? 3 ^L _ 0nCe * he DNA fra9mentS are in the appropriate solution (compatible with electrospray and mass sDectram- 
etry) ■» tag car .be cleaved from the DNA fragment The DNA fragment (or remnants •J255E£E S^nZ 

^h^^^Tlr e,eCtriC ,ieW < preferab '* " * «*»»«■ charge oTL of ^Dr^t^S 

£L« 'ntroduced into the electrospray device by the use of an electric field or a flowing liquid 

[02401 Fluorescent tags can be identified and quantitated most directly by their absorption and fluorescence emis 
sion wavelengths and intensities. "uorescence emis- 

(0241 J While a conventional spectrofluorometer is extremely flexible, providing continuous ranges of excitation and 
emission wavelengths (l EX . I S1 . (s2 ). more specialized instruments such as flow cytometers J^Zm^Z^Z 

Stersl^^^^ 

of *2 I* XL? TrT *TZ5l " ^ tOWart * nigh-intensity illumination! Z^eve^e^eSu^ 

nil, ^ UOr ° Ph ° re (photob,eachin 9J becomes *e factor limiting fluorescence detectabilfty T^pracS^* 
of photobleaching depends on the fluorescent detection technique in question 9 pract,caJ ,mpact 

Son ^"^^11° ° n ^ n tHe *■* " device (an intertice > ^ be inter P<*«* between the separation and 
detection steps to permit the continuous operation of size separation and tag detection (in real time) Thfeuni^^ 

separation methodology andinstrumentation with the detection methodology -ESSEEl^ TstioE 

(02441 The function of the interface is primarily the release of the (eg., mass spectrometry) tao from analvta tw q 

h^ 8 *? T 65 ,^ 8 imp,emeritations <* <"« '"t^ace. The design^ the W^b?S3^T!SSi2?3 
cleavaWe linkers. In the case of light or photo-cleavable linkers, an energy or photon source is reouiVed totoTZtl 2 
an add-labfle linker, a base-labile linker, or a disulfide linker, reagent add-on 

of heat-.ab.te linkers, an energy heat source is required. Enzyme addition is required wSSfiJ ,S£ 
such as aspect protease and a peptide linker, a nuclease and a DNA or RNA linker, a glycc^ase^P ™ 

2 ata !! T 8 " nk6r - ** iS UnStaWe after c,eava 9 e < e -9- simifiar to chemiluminescem subsS^ther characST 
.sties of the interface include minimal band broadening, separation of DNA from tags before Hecta Tn£ a 
spectrometer. Separation techniques include those based on e.ectrophoretic methods an^nSeT Sity tiST 
niques. size retention (dialysis), titration and the like. lecnniques. affinity tech- 

(02451 it is also possible to concentrate the tags (or nucleic acid-linker-tag construct), capture electroohoreticallv 
and then release into alternate reagent stream which is compatible with the partial ^Ttf toSon^S 
selected. The interface may also be capable of capturing the tags (or nucleic aM Un^g^^TrZZ^ 
shooting the bead(s) into chamber and then preforming laser desorptionA,aporization. S^TpSS 
«ow ,nto alternate buffer (e g from capillary electrophoresis buffer into hydrophobic buffer across a^meabTml 
brane). ft may also be des,rable in some uses to deliver tags into the mass spectrometer intermittenttTwhth 3d 
comprise a further function of the interface. Another function of the interface is to deliver tags from coluWSS 

^^cT"- WitH 3 TOta1in9 timS S,0t ** 6aCh CO,Umn Afeo ' rt is P«W to deliver tags KvJSSSS 
Se^rometer ^ ^ "* Set ° f * QS ^ 3 f6W ^thi del'ert a ta« 

[02461 The following is a list of representative vendors for separation and detection technologies which mav be 
used in the present invention. Hoefer Scientific Instruments (San Francisco. CA) manufactures eleX^^rS^ 
ment (Two Step™. Poker Face™ II) for sequencing applications. Pharmacia BiS^ti^^aScTres 
e.ectrophores.s equ^ment for DNA separations and sequencing (PhastSystem for PCR-SSCP ana^sis? Maao^nS 
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System for DNA sequencing). Perkin Elmer/Applied Biosystems Division (ABI, Foster City. CA) manufactures semi- 
automated sequencers based on fluorescent-dyes (ABI373 and ABI377). Analytical Spectral Devices (Boulder, CO) 
manufactures UV spectrometers. Hitachi Instruments (Tokyo, Japan) manufactures Atomic Absorption spectrometers. 
Fluorescence spectrometers. LC and GC Mass Spectrometers. NMR spectrometers, and UV-VIS Spectrometers. 
PerSepth/e Biosystems (Framingham. MA) produces Mass Spectrometers (Voyager™ Elite). Bruker Instruments Inc. 
(Manning Park. MA) manufactures FTIR Spectrometers (Vector 22). FT- Raman Spectrometers. Time of Flight Mass 
Spectrometers (Reflex II™). Ion Trap Mass Spectrometer (Esquire™) and a Maldi Mass Spectrometer. Analytical Tech- 
nology Inc. (ATI. Boston, MA) makes Capillary Gel Electrophoresis units. UV detectors, and Diode Array Detectors. Tel- 
edyne Electronic Technologies (Mountain View, CA) manufactures an Ion Trap Mass Spectrometer (3DQ Discovery™ 
and the 3DQ Apogee™). Perkin Elmer/Applied Biosystems Division (Foster City. CA) manufactures a Sciex Mass Spec- 
trometer (triple quadruple LC/MS/MS, the AP1 100/300) which is compatible with eJectrospray. Hewlett-Packard (Santa 
Clara, CA) produces Mass Selective Detectors (HP 5972A). MALDI-TOF Mass Spectrometers (HP G2025A). Diode 
Array Detectors, CE units. HPLC units (HP1090) as well as UV Spectrometers. Fmnigan Corporation (San Jose, CA) 
manufactures mass spectrometers (magnetic sector (MAT 95 S™). quadrapole spectrometers (MAT 95 SQ™) and four 
other related mass spectrometers). Rainin (Emeryville. CA) manufactures HPLC instruments. 

[0247] The methods and compositions described herein permit the use of cleaved tags to serve as maps to partic- 
ular sample type and nucleotide identity. At the beginning of each sequencing method, a particular (selected) primer is 
assigned a particular unique tag. The tags map to either a sample type, a dideoxy terminator type (in the case of a 
Sanger sequencing reaction) or preferably both. Specifically, the tag maps to a primer type which in turn maps to a vec- 
tor type which in turn maps to a sample identity. The tag may also may map to a dideoxy terminator type (ddTTP. ddCTP, 
ddGTR ddATP) by reference into which dkfeoxynudeotide reaction the tagged primer is placed. The sequencing reac- 
tion is then performed and the resulting fragments are sequentially separated by size in time. 

[0248] The tags are cleaved from the fragments in a temporal frame and measured and recorded in a temporal 
frame. The sequence is constructed by comparing the tag map to the temporal frame. That is. all tag identities are 
recorded in time after the sizing step and related become related to one another in a temporal frame. The sizing step 
separates the nucleic acid fragments by a one nucleotide increment and hence the related tag identities are separated 
by a one nucleotide increment By foreknowledge of the dtdeoxy-terminator or nucleotide map and sample type, the 
sequence is readily deduced in a linear fashion. 

[0249] The following examples are offered by way of illustration, and not byway of limitation. 
[0250] Unless otherwise stated, chemicals as used in the examples may be obtained from Aldrich Chemical Com- 
pany. Milwaukee, Wl. The following abbreviations, with the indicated meanings, are used herein: 

ANP » 3-(Fmoc-amino)-3-<2-nrtrophenyl)propionic acid 
NBA =» 4-(Fmoc-aminomethyl)-3-nrtrobenzoic acid 

HATU m 0-7-azabenzotriazol-1-yl-N f N.N , ,N , -tetramethyluronium hexafluorophosphate 
OIEA = diisopropylethylamine 
MCT = monochlorotriazine 
NMM » 4-methylmorpholine 
NMP = N~metfiy!pyrrolidone 

ACT357 ■ ACT357 peptide synthesizer from Advanced ChemTech, Inc.. Louisville, KY 
ACT = Advanced ChemTech, Inc., Louisville, KY 

NovaBiochem = CalBiochem-NovaBiochem International. San Diego. CA 

TFA = Trrf luoroacetic acid 

Tfa = Trif luoroacetyl 

iNIP = N-Methylisonipecotic acid 

Tfp = Tetrafluoropherryl 

DIAEA = 2-(Diisopropy!amino)ethylamine 
MCT = monochlorotriazene 

S'-AH-ODN m S'-aminohexyl-tailed ofigodeoxynucleotide 
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EXAMPLES 



EXAMPLE 1 



PREPARATION OF ACID LABILE LINKERS FOR USE IN CLEAVABLE-M W- IDENTIFIER SEQUENCING 

T^j^ ""** 1 * FstPrs * rhPmica " Y ( ***»' M ^ SBgansggy lags to ■ ^ lags ^ n nr . 



[0251] Figure 1 shows the reaction scheme. 



SieaA TentaGel S AC resin (compound II; available from ACT; 1 eq.) is suspended with Dmp in th« 

££ S^^^SST r, 1 ^^ 8 is remWed resin ^sheo Sth nS?(2^ 

(2X) - ^ DMF (2X) ' The ^""S «* 1 «o the resin and the wash steps are repeated, to give compound li?' 

S!e£_S. The resin (compound III) is mixed with 25% pioeridine in DMF and ^t-wc-; -n. . . 

Sje&£. The deprotected resin from step B is suspended in DMF and to it is added an FMOC-nrot^-H am - ^ 
containing amine functionality in its side chain (compound IV e q atoha N Tfm?^!SS^T aa * 
from Synthetech. Afoany. OR; 3 eq,. HATU (3 «,)^EA (7.5 2ft S ^ 

a^^rashsta^L and rSS » n l" 88 "** ""^ NMP ^ 2X )> WeOH (2XXand DMF (2X) l^^coi^ig c^rJto the resin 
and the wash steps are repeated, to give compound V. ***™»g o» iv to me resin 

S1SLT2 The resin (compound V) is treated with pipeline as described in step B to remove the FMOO mn t>,» 
deprotected res,n is men divided equaJly by the ACT357 from the collection vessel inro^ re^c^S^ 9 

*e shaKen for 1 hr. The solvent is SlSSSSSK ifi^SSX -^^^^2^^ 
(2X). The coupling of V,, 16 to the allots of resin and the wash steps are tS^^S^'S^ 

individual tubes. The aliquots of resin are washed with CH,CI 2 (2 J) WMeOH fexfaTmt hSST ^ 
the individual tubes. The individual tubes are evaporatedTv^ JSSSSS^^,^ 

Each of the free carboxylic acids Villus is dissolved in DMF. To each solution is added pyridine m oseot 
flowed by pentaf luorophenyl trrf luoroacetate (1 . 1 eq.). The mixtures are stirred for Jmtoat SSJES^LX' 
The solutions are diluted with EtOAc. washed with 1 M aq. citric acid (3X) and 5% ao N^cn^TJf ^1 
Na 2 S0 4 . filtered, and evaporated in vacuo, providing compounds IX, ,6 3 ^ Wef 

■ - iry?? 91 Penta,luoro P nen y Fstere <* Chemically Cteavhto M acc Soectrn^opy Ta gs . ;K rrtfn T . 
boxvl Aeiri Tgi-m^ 661 w . a» uvaaw i r 

[0252] Figure 2 shows the reaction scheme. 

Siea_A. 4-(Hydroxymethyt)phenoxybutyric acid (compound I; 1 eq.) is combined with DIE A ,„,,. 
m,de (2.1 eq.) in CHC 3 and heated to reflux for 2 hr. The mixture* dilut* vS^S* ?S£E S IS 

sssr e buffer (2X) - and br,ne (ix) - dried — Na ^- *<* stis^i^ 

a^on^a^ 

able from Synthetech. Albany. OR; 1 eq.). N-methylLX^ ^eqjana STSnSSSSTl ^ 
temperature for 4 hr. The mixture is diluted w*h CH 2 C», washed with 1 M ag. \£t wafer " S££ 
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aq. NaHC0 3 (2X). dried over Na 2 S0 4 , and evaporated in vacuo. Compound III is isolated by flash chromatography 

(CH 2 Cl2-> EtOAc). 

Step C. Compound III is dissolved in CH 2 CI 2 . Pd(PPh3) 4 (0.07 eq.) and N-methylaniline (2 eq.) are added, and the 
mixture stirred at room temperature for 4 hr. The mixture is diluted with CH 2 CI 2 . washed with 1 M aq. citric acid (2X) 
and water (1X). dried over Na^O* and evaporated in vacuo. Compound IV is isolated by flash chromatography 

(CH 2 Cl2-» EtOAc + HOAc). 

Step P. TerrtaGel S AC resin (compound V; 1 eq.) is suspended with DMF in the collection vessel of the ACT357 
peptide synthesizer (Advanced ChemTech Inc. (ACT), Louisville, KY). Compound IV (3 eq.), HATU (3 eq.) and 
DIEA (7.5 eq.) in DMF are added and the collection vessel shaken for t hr. The solvent is removed and the resin 
washed with NMP (2X). MeOH (2X), and DMF (2X). The coupling of IV to the resin and the wash steps are 
repeated, to give compound VI. 

Step 5 The resin (compound VI) is mixed with 25% pipericfine in DMF and shaken for 5 min. The resin is filtered, 
then mixed with 25% pipericfine in DMF and shaken for 10 min. The solvent is removed and the resin washed with 
NMP (2X), MeOH (2X), and DMF (2X). The deprotected resin is then divided equally by the ACT357 from the col- 
lection vessel into 16 reaction vessels. 

SlfitLE The 16 aliquots of deprotected resin from step E are suspended in DMF To each reaction vessel is added 
the appropriate cartxwylic acid VII^q (R^gCO^; 3 eq.). HATU (3 eq.). and DIEA (7.5 eq.) in DMF. The vessels 
are shaken for 1 hr. The solvent is removed and the aliquots of resin washed with NMP (2X). MeOH (2X). and DMF 
(2X). The coupling of Vll t . 16 to the aliquots of resin and the wash steps are repeated, to give compounds VIIf 1 . 16 . 

Step Q. The aliquots of resin (compounds Villus) are washed with CH2CI2 (3X). To each of the reaction vessels 
is added 1% TFA in CH2CI2 and the vessels shaken for 30 min. The solvent is filtered from the reaction vessels into 
individual tubes. The aliquots of resin are washed with CH 2 Cl2 (2X) and MeOH (2X) and the filtrates combined into 
the individual tubes. The individual tubes are evaporated in vacuo, providing corrpounds IX t . 16 . 

Stop H» Each of the free carboxylic acids IX t . 16 is dissolved in DMF. To each solution is added pyridine (1 .05 eq ) 
followed by pentafluorophenyl trifluoroacetate (1.1 eq.). The mixtures are stirred for 45 min. at room temperature 
The solutions are diluted with EtOAc, washed with 1 M aq. citric acid (3X) and 5% aq. NaHCOg (3X), dried over 
Na 2 S0 4 , filtered, and evaporated in vacuo , providing compounds Xt. 16 . 

EXAMPLE 2 

DEMONSTRATION OF PHOTOLYT1C CLEAVAGE OF TLX 

[0253] A T-L-X compound as prepared in Example 13 was irradiated with near-UV light for 7 mm at room tempera- 
ture. A Rayonett fluorescence UV lamp (Southern New England Ultraviolet Co.. Middletown. CT) with an emission peak 
at 350 nm is used as a source of UV light. The lamp is placed at a 15-cm distance from the Petri dishes with sanples. 
SDS gel electrophoresis shows that >85% of the conjugate is cleaved under these conditions. 

gXAMPLE 3 

PREPARATION OF FLUORESCENT LABELED PRIMERS AND DEMONSTRATION OF CLEAVAGE OF FLUORO- 
PHORE 

Synthesis and Purification of Oligonucleotides 

[0254] The oligonucleotides (ODNs) are prepared on automated DNA synthesizers using the standard phospho- 
ranuldite chemistry supplied by the vendor, or the H-phosphonate chemistry (Glenn Research Sterling. VA). Appropri- 
ately blocked dA. dG. dC. and T phosphoramidites are commercially available in these forms, and synthetic nucleosides 
may readily be converted to the appropriate form. The oligonucleotides are prepared using the standard phosphora- 
midite supplied by the vendor, or the H-phosphonate chemistry. Oligonucleotides are purified by adaptations of stand- 
ard methods. Oligonucleotides with SMrityl groups are chromatographed on HPLC using a 12 micrometer. 300 # Rainin 
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(Emeryville. CA) Dynamax C-8 4.2x250 mm reverse phase column using a gradient of 15% to ss*. MoTm in n , m 
Et 3 NH*OAc- pH 7.0. over 20 min. When deflation is performed, the tM^JS^ll^^!^^ £L 

StSTOiSJStSSE** - - 0,i9 ™ es « ^-ssit 

[0255] Preparation of 2.4,6-trichiorotriazine derived olbonudeotides- mfn mnn .^rrf «• *< , ... 

[02561 The activated purified oligonucleotide is then reacted with a ^00-fold mola^cSsT^Sn^ « « 
sod.um borate at pH 8.3 for 1 hour at room temperature. The unreacted cjaET^SS ot^^SuU £ i 

Texas suitoTcTto" e ( Mo^C pfc^ E^e <S S ^ T ^ ( *> ^ ^ * 
ride ( Mo,ecu JZbes. Eugene. Of^E^^ 

tions consist of 0.15 M sodium borate at oH 8 3 fnr 1 ho. .r r f ^OTirocyanate. me final reaction concfi- 

minutes at room temperature. Ruorescence is measured in a black microliter ^ tvTLT^T inCUbaled ** 15 
incubation tubes (150 microliters, and placed in a black SSEE STJKE SoratS ^Z, A 7^ 
Plates are then read directly using a Fluoroskan I. fluorometer (Row UtLESm ESTlS 2^ 11" 
wavetength of 495 nm and monitoring emission at 520 nm for fluorescein. usin^c^^ 
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40 



I Moles of Fluorochrome 


I RFU non-cleaved 


RFU cleaved 


RFU free 


10 x 10 5 M 


J sa 


1200 


1345 


3.3 x 10 6 M 


2.4 


451 


456 


1.1x10 6 M 


0.9 


135 


130 

• 


3.7x10 7 M 


0.3 


44 


48 


I 12x10 7 M 


0.12 


15.3 


16.0 


I 4.1 x 10 7 M 


0.14 


4.9 


5.1 


14x10 8 M 


0.13 


2.5 


2.8 


4.5x10 9 M 


0.12 


0.8 


0.9 



45 Sd JS^SST - " ^ jS *"* 3 ^ inCreaSe ^ ^ f,UOreSCeTOe the fluorochrome 
EXAMPLE 4 
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PREPARATION OF TAGGED M13 SEQUENCE PRIMERS AND DEMONSTRATION OF CLEAVAGE OF TAGS 
[0260] The activated purified oligonucleotide is then reacted with a 100-fold molar excess of cystamine in 0.15 M 
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allowed to polymerize for at least 30 minutes. Prior to loading, the tape around the bottom of the gel and the well-form- 
ing comb is removed. A vertical electrophoresis apparatus is then assembled by clamping the upper and lower buffer 
chambers to the gel plates, and adding 1 X MTBE electrophoresis buffer to the chambers. Sample wells are flushed with 
a syringe containing running buffer, and immediately prior to loading each sample, the well is flushed with running buffer 
using gel loading tips to remove urea. One to two microliters of sample is loaded into each well using a Pipetteman 
(Rainin. Emeryville. CA) with gel-loading tips, and then electrophoresed according the following guidelines (during elec- 
trophoresis, the gel is cooled with a fan): 





termination reaction polyacryla- 
mide gel 


electrophoresis conditions 


short 

long 

long 


5%. 0.15 mm x 50 cm x 20 cm 
4%, 0.15 mm x 70 cm x 20 cm 
4%. 0.15 mm x 70 cm x 20 cm 


2.25 hours at 22 mA 
8-9 hours at 15 mA 
20-24 hours at 15 mA 



[0307] Each base-specific sequencing reaction terminated (with the short termination) mix is loaded onto a 0.15 
mm x 50 cm x 20 cm denaturing 5% polyacrytamide gel; reactions terminated with the long termination mix typically are 
divided in half and loaded onto two 0.15 mm x 70 cm x 20 cm denaturing 4% poiyacryl amide gels. 
[0308] After electrophoresis, buffer is removed from the wells, the tape is removed, and the gel plates separated. 
The get is transferred to a 40 cm x 20 cm sheet of 3MM Whatman paper, covered with plastic wrap, and dried on a 
Hoefer (San Francisco. CA) gel dryer for 25 minutes at 80°C. The dried gel is exposed to Kodak (New Haven. CT) XRP- 
1 film. Depending on the intensity of the signal and whether the radiolabel is ^P or 35 S. exposure times vary from 4 
hours to several days. Afier exposure, films are developed by processing in developer and fixer solutions, rinsed with 
water, and air dried. The autoradiogram is then placed on a light-box. the sequence is manually read, and the data 
typed into a computer. 

[0309] Taq-polymerase catalyzed cycle sequencing using labeled primers. Each base-specific cyde sequencing 
reaction routinely included approximately 100 or 200 ng isolated single-stranded DNA for A and C or Q and T reactions, 
respectively. Double-stranded cycle sequencing reactions similarly contained approximately 200 or 400 ng of plasmid 
DNA isolated using either the standard alkaline lysis or the diatomaceous earth-modified alkaline lysis procedures. All 
reagents except template DNA are added in one pipetting step from a premix of previously aliquoted stock solutions 
stored at -20°C. Reaction premixes are prepared by combining reaction buffer with the base-specific nucleotide mixes. 
Prior to use. the base-specific reaction premixes are thawed and combined with diluted Taq DNA polymerase and the 
individual end-labeled universal primers to yield the final reaction mixes. Once the above mixes are prepared, four aliq- 
uots of single or double-stranded DNA are pipetted into the bottom of each 0.2 ml thin-walled reaction tube, correspond- 
ing to the A, C. Q. and T reactions, and then an aliquot of the respective reaction mixes is added to the side of each 
tube. These tubes are part of a 96-tube/retainer set tray in a microtiter plate format which fits into a Perkin Elmer Cetus 
Cycler 9600 (Foster City. CA). Strip caps are sealed onto the tube/retainer set and the plate is centrifuged briefly. The 
plate then is placed in the cycler whose heat block had been preheated to 95°C. and the cycling program immediately 
started. The cycling protocol consists of 15-30 cycles of: 95°C denaturation; 55°C annealing; 72°C extension; 35°C 
denaturation; 72°C extension; 95°C denaturation. and 72°C extension, linked to a 4°C final soak file. 
[031 0] At this stage, the reactions may be frozen and stored at -20°C for up to several days. Prior to pooling and 
precipitation, the plate is centrifuged briefly to reclaim condensation. The primer and base-specific reactions are pooled 
into ethanoi. and the precipitated DNA is collected by centrifugation and dried. These sequencing reactions could be 
stored for several days at -20°C. 

[0311] The protocol for the sequencing reactions is as follows. For A and C reactions. 1 pJ, and for G and T reac- 
tions. 2 ^l of each DNA sample (100 ng/ul for M13 templates and 200 ng/ul for pUC templates) are pipetted into the 
bottom of the 0.2 ml thin-walled reaction tubes. AmpliTaq polymerase (N801-0060) is from PerWn-Elmer Cetus (Foster 
City, CA). 

[0312] A mix of 30 .ul AmpliTaq (5U/ M I). 30 M l 5X Taq reaction buffer. 130 jil ddH20. and 190 nl diluted Taq for 24 
clones is prepared. 

[0313] A. C. G, and T base specific mixes are prepared by adding base-specific primer and diluted Taq to each of 
the base specific nucleotide/buffer premixes: 



A.C/G.T 
60/120 



5X Taq cycle sequencing mix 
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30/60 mI 
120/240 mI 
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diluted Taq polymerase 

respective fluorescent end-labeled primer 



* 




recenB, observed la, the nested lnwJt!*E£ZZ^£S5EZ f^**™"""" ,eaata " « has bMn 

f»« •» *™<« « -coons CSSSSSESSi^SKSa frtn^e?" 
lions requite one reaction tubs tor each of the tour terminators Th. rnu^^^l ^* ac " 

ond. 60»C for 4 minutes, and then link to a 4°C hold) ? 5 SeCOnd8, 50 C 1 sec * 

Sep cotomns (Amiccn. Beverly. MA) or G-50 12^^^^ -*» ^ 

C. Termini 



20 



25 



30 



35 



40 



45 
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E. ThCc™^^^ 0- -teria. to set-, to the bottom of ^ 

mix. Allow the gel to hydrate for at lea^Mminutl ^^tl^ * U ™ *"* * S<V6ra ' fimes to 

Altow columns mat havTbeen stSed^C ZlTnr toZZltST 8 ^ ^ ^ a ,e " ^ at 
ing the column and allowing the gel to setUB R^e^^^ ^n^T * V ™ bubb ' eS by invert - 

the column to drain completely, by gravity O^^Tl^Z^.- ''^andthen remove the tower-end cap. Allow 
with a pipet bulb.) InseTlhe c^^tne^Jh ^SSS^S^!^ ^Of^P^e to the column 
tor 2 minutes to remove the fluid. ReZe^u™ Z 5? ^S^t^T^l microce "*^^« at 1300 xg 
Carefully remove the reaction mixture (20 uD armload rt oTtToTS ITJ^JThV . I**™** CO " ection tube - 
cycling instrument that required overlaying wrthS mJiZ ^ITl^t J? f * Samp>eS Were in «*ated in a 

same orientation as it was* forthefirs7s£ ?KSJSSSIS ^J^^SS*' P,adn9 C °' Umn in *• 
desired, reactions can be precipitated witiTethano. certortoge. Do not apply heat or over dry. If 

D. Terminator Reaction Clean-Up vi a $epha dex Filled Mirror Fn, mat nttaf Platoc 

Place microtiter fitter plate on top of a SiT tol^S S£l22 T °' ^ I** 

trifogation. Spin at 1500 rpm for 2 minutes D^^I^Ti* ^? T ^ do no « apart during cen- 

microtter fitter plate on top'of a ^rottter ^.^^ 

gafoa Add an additional 100-200 uJ of Sephadex 050 to fill the mi Jotter ptate" ells ZvSS^^T" 

[0318J Sequenase"' fUSS, C/eve/and OH) catalyzed sequencing w ith labeled terminators Si™.* ^«h^ . • 
nator reactions require approximately 2 M g of phenol extracted M1 3-tsed te^ZTZ 

the primer annealed by incubating DNA. primer and buffer at fif:o r Affar ^ ^- A ,s denatur ed and 

alpha-thto^eoxynucleot^es. labeled terminators! Sd^ed Se^^^ 

ture ,s incubated at 37'C. The reaction is stopped by adding ammonium ^acetate arSe^aS Frt 2£KE!^ 
are Precp-tated and dried. To aid in the removal of unincorporated terminators. lheb N Apellet TSm EElSSf 
anol. The dried sequencing reactions could be stored up to several days at -20»C 

[0319] Double-stranded terminator reactions required approximately 5 ug of diatomaceous earth modified-a.ka.ine 
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lysis midi-prep purified plasmid DNA. The double-stranded DNA is denatured by incubating the DNA in sodium hydrox- 
ide at 65°C, and after incubation, primer is added and the reaction is neutralized by adding an acid-buffer Reaction 
buffer, alpha-thio-deoxynucleotides. labeled dye-terminators, and diluted Sequenase TM DNA polymerase then are 
added and the reaction is incubated at 37°C. Ammonium acetate is added to stop the reaction and the DNA fraaments 
are precipitated, rinsed, dried, and stored. 
[0320] For Single-stranded reactions: 
Add the following to a 1.5 ml microcentrifuge tube: 

4 pi ss DNA (2 plq) 

4 nJ 0.8 \iM primer 

2 Ml 10x MOPS buffer 

2 mI 1 0x Mn 2 Visocitrate buffer 

12 mI 



[13321] To denature the DNA and anneal the primer, incubate the reaction at 85°C-70-C for 5 minutes Allow the 
reaction to cool at room temperature for 1 5 minutes, and then briefly centrifuge to reclaim condensation To each reac- 
tion, add the following reagents and incubate for 1 0 minutes at 37°C. 

7mI ABI terminator mix (Catalogue No. 401489) 

2 mI diluted Sequenase TM (3.25 U/jd) 

1 Ml 2mM a-S dNTPs 
22 mI 



[0322] The undiluted Sequenase TM (Catalogue No. 70775. United States Biochemicals. Cleveland OH) is 13 U/ul 
and is diluted 1 :4 with USB dilution buffer prior to use. Add 20 M l 9.5 M ammonium acetate and 100 ul 95% ethanol to 
stop the reaction and mix. 

[0323] Precipitate the DNA in an ice-water bath for 10 minutes. Centrifuge for 15 minutes at 10 000 xg in a micro- 
centrifuge at 4°C. Carefully decant the supernatant and rinse the pellet by adding 300 jil of 70-80% ethanol Mix and 
centnfuge again for 15 minutes and carefully decant the supernatant 

[0324] Repeat the rinse step to insure efficient removal of the unincorporated terminators. Dry the DNA for 5-10 

minutes (or until dry) in the Speed- Vac. and store the dried reactions at -20°C. 

[0325] For double-stranded reactions: 

Add the following to a 1 .5 rrd microcentrifuge tube: 

5 mI ds DNA (5 *ig) 
4 mI 1 N NaOH 
3 mI ddHjP 



[0326] Incubate the reaction at 65*C-70*C for 5 minutes, and then briefly centrifuge to reclaim condensation Add 
the following reagents to each reaction, vortex, and briefly centrifuge: 

3 mI 8 pM primer 
9 mI ddH 2 0 

4 mI MOPS-Acid buffer 



[0327] To each reaction, add the following reagents and incubate for 10 minutes at 37°C. 

4 mI 1 0X Mn 2 Visocitrate buffer 

6 mI ABI terminator mi 

2 mI diluted Sequenase TM (3.25 U/mO 

1 Ml 2mM [alpha]-S-dNTPs 

22 mI 



[0328] The undiluted SEQUENASE™ from United States Biochemicals is 13 U/ul and should be diluted 1 4 with 
USB dilution buffer prior to use. Add 60 mI 8 M ammonium acetate and 300 mI 95% ethanol to stop the reaction and vor- 
tex. Precipitate the DNA in an ice-water bath for 1 0 minutes. Centrifuge for 1 5 minutes at 1 0.000 xg in a microcentrifuge 
at 4°C. Carefully decant the supernatant, and rinse the pellet by adding 300 m* ol 80% ethanol. Mix the sample and cen- 
trifuge again for 15 minutes, and carefully decant the supernatant Repeat the rinse step to insure efficient removal of 
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the unincorporated terminators. Dry the DNA for 5-10 minutes (or until dry) in the Speed-Vac 



Sequence gel preparation. 
A8l 373A DNA seouencer 



>horesis. sampje loading, Piprt. 



data collection, and analy^ rm ^ 



[0329J Polyacrylam.de gels for DNA sequencing are prepared as described above, except that the qel mix isfiltered 
poor «o polymenzat.cn. Glass plates are carefully cleaned with hot water, distilled wat* a^StoxT to femo^^SS 
fluorescent contaminants prior to taping. Denaturing 6% polyacryfamide eels are ZST™ ^iT^ 
taped plates and fitted with a 36 we., comb. After piynJSS^Z^^S^r^Z VZfi? ' ? Z 

assembled .nto an ABI sequencer, and the checked by laser-scanning. It baseline alterations Ire obSr^nnt£ 
assoaated Macintosh computer display, the plates are recfeaned. Subsequently the SSSS f JSSSS 
^ff^ is and the ge. is pre-electrophoresed for 10-30 " pT^^^Z^Z 

pooled and dried reaction products are resuspended in formamide>EDTA loading bufl*by Z££?£Z 

2 ^ i 6 * 1 ,S Created witWn 816 ABI data section software on ^^Z^^J^^^ 

the number of samples loaded and the fluorescent-labeled mobility file to use for sequencTdaS or«^ 
cleaning the sample weds with a syringe, the odd-numbered sequencing ^^7^^ ^^^^' Z 
using a micropipets equipped with a f tot-tipped gel-loading t^The gS T^^^^^J^l^ 
the wells are cleaned again and the even numbered samples are loaded I Th^ tESS?^ 
terminators is specified on the ABI 373A CPU T^U^Z^^^'^^^L for dye-pnmers and dye- 

so " wara r , e,ates ""orescent signal detected to the corresponding scan nunSTXi 52- 
mines the sample lane posrtions based on the signal intensities. AfteTtoe lanes are tracked the cn^s^TJ SOL 
for each lane are extracted and processed by baseline subtraction, mobility calculaSn^ert Jdtco^SSnn t£ 
.macorrecjon^er processing, toe sequence data files are transferred to a SlwSLSSJSSS^ 
[0330J Protocol: prepare 8 M urea. 4.75% polyacrylamide gels, as described above, using a 36?w^l con* Prior to 

SfSi^'IlT" 6 ? 961 P ' ateS - ASS9mb,e the »- PW- '"to an AB^A ON/ £££ 
City. CA) so that the tower scan (usually the blue) line corresponds to an intensity value of 800-iorS i2?IEL 2? 
the computer data collection window. If the baseline of four-cS scan .in" 2SSLSSSiLf5t 
aluminum heat distribution plate. Pre-electrophorese the ael far in -in mi™*** D ™ ™! 8 ?' ass P ,at es. Affix the 
Ml of FE to the bottom of each tube, vortex tea S 90°C to rf^JT^JZ? ,0adin9 - 3 

the sample wells wfth electrophoresis buTer usto* a syringe UsTo fS COndensation - 
numbed sample. Pre-eleJophorese toe geffo^ K^SrWrrr 
numbered sample. Begin the electrophoresis (30 W for 10 hours). After data collection the A^so^aTwiSn^!' 

™zzt:t ana,ysis so,tware - **** wm create the ^ - sss sees 



F. Double-stranded s< 



cDNA clpnes containinn I, 



>IWA] 



inchon 



[0331] Double-stranded templates of cDNAs containina lona nnivfA^ trar*e ~ » ♦ 

ercner ^oa; or (au) or (a(J) at the 3 end were designed to 'anchor* the Drimorc anH a iir^ ^ . MM . x 717 " 

diately upstream of the po.y(A, region. Using this'protocoToDS 30O fcpTre^^^ 

r'Sr^JTT^? ^ 11,636 CDNAS ^ determined insert-speclficprime^ X SSSJS 
region. The ability to d.rectly obtan sequence immediately upstream from the poly(A) tail of cDNAs steX SJ 

t»™ POria Z CB t0 lar9e SCa ' e efforts to 9e"erate sequence-tagged sites (STSs) from cDNAs ^ 
2^?ma c ?H Pr ° tOCO, !l aS fo " , r S - Syn * esi2e anchor ^ poly (dT) 17 with anchors of (dA) or (dC) or (dG) at the 3' end 
on a DNA synthesizer and use after purrfication on Oligonucleotide Purification Cartridges (Amicc^ Beveriv MM For 
sequenang with anchored pnmers. denature 5-^g of plasmid DNA in a total volume of £ ^xSJno^iJ Y£L«Z 
hydroxy and 0.16 mM EDTAby incubation at 65-C for 10 minutes. Add the three ^SSffiSil S^Z 

of^each) and ,mmed,a«e.y p.ace the mixture on ice. Neutralize toe so.u«on by addin^S ml 3f 5 M^ntSnTu^icetate. 

[03331 Precipitate the DNA by adding 1 50 pi of cold 95% ethano. and wash toe pellet twice with cold 70* e *h a „oi 
Dry the pellet for 5 minutes and then resuspend in MOPS buffer. Anneal toe primers by neX toe SEtS^J ? m 
utes at 65 a C followed by slow cooling to room temoerature far 1 «; -?n ™r,.*~ aZ~ 9 for 2 mm- 

mm* T7 DNA po^as. and «?Wp SS^r^SZ'^ "*° 
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G cDNA sequencing based on PCR and random shotgun cloning 

[0334] The following is a method for sequencing cloned cDNAs based on PCR amplification, random shotgun clon- 
ing, and automated fluorescent sequencing. This PCR-based approach uses a primer pair between the usual "univer- 
sal- forward and reverse priming sites and the multiple cloning sites of the Stratagene Bluescript vector These two PCR 
pnmers. with the sequence 5-TCGAGGTCGACGGTATCG-3* (Seq. ID Mo. 15) for the forward or -16bs primer and 5 1 - 
GCCGCTCTAGAACTAG TG-3' (Seq. ID No. 16) for the reverse or +19bs primer, may be used to anplify sufficient quan- 
tities of cDNA inserts in the 1 .2 to 3.4 kb size range so that the random shotgun sequencing approach described below 
could be implemented. 

[0335] The following is the protocol. Incubate four 100 pi PCR reactions, each containing approximately 100 ng of 
plasmid ON A, 100 pmoles of each primer. 50 mM KCI. 10 mM Tris-HCI pH 8.5. 1.5 mM MgCI 2 . 0.2 mM of each dNTP 
and 5 units of PE-Cetus Amplttaq in 0.5 ml snap cap tubes tor 25 cycles of 95°C tor 1 minute. 55°C for 1 minute and 
72-C for 2 minutes in a PE-Cetus 48 tube DNA Thermal Cycler. After pooling the tour reactions, the aqueous solution 
containing the PCR product is placed in an nebulizer, brought to 2.0 ml by adding approximately 0.5 to 1.0 ml of glyc- 
erol, and equilibrated at -20°C by placing it in either an isopropyl alcohol/dry ice or saturated aqueous Nad/dry ice bath 
for 10 minutes. The sample is nebulized at -20»C by applying 25 - 30 psi nitrogen pressure for 2.5 min. Following ethanol 
precipitation to concentrate the sheared PCR product, the fragments were blunt ended and phosphorated by incuba- 
tion with the Klenow fragment of E. coli DNA polymerase and T4 polynucleotide kinase as described previously. Frag- 
ments in the 0.4 to 0.7 kb range were obtained by elution from a low melting agarose gel. 

[0336] From the foregoing, it will be appreciated that, although specific embodiments of the invention have been 
described herein for purposes of illustration, various modifications may be made without deviating from the spirit and 
scope of the invention. Accordingly, the invention is not limited except as by the appended claims. 
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SEQUENCE MSTTMr. 

(1) GENERAL [FORMATION: 



(i) APPLICANTS: Van Mess. Jeffrey 

Tabone. John C. 
Howbert. J. Jeffry 
Mulligan. John T. . 

(ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR DETERMINING 
THE SEQUENCE OF NUCLEIC ACID MOLECULES tK ™™NG 

( 1 i i ) NUMBER CF SEQUENCES : 16 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY 

(B) STREET; 6300 Columbia Center. 701 Fifth Avenue 
CO CITY: Seattle 

.■(D) STATE : Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

r 

I 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM : PC-DOS/MS-DOS 

COV SOFTWARE: Patentln Release #1.0. Version #1.30 

(vi ) CURRENT APPLICATION DATA- 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 22- JAN- 1997 

(C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: McMasters. David D 

(B) REGISTRATION NUMBER : 33 963 

(O REFERENCE/DOCKET NUMBER: 240052.416 

(ix) TELECOMMUNICATION INFORMATION 
(A) TELEPHONE: (206) 622-4900 
(8) TELEFAX: (206) 682-6031 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 1 : 
TGTAAAACGA CGGCCAGT 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY, linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
TGTAAAACGA CGGCCAGTA 
(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
TGTAAAACGA CGGCCAGTAT 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
TGTAAAACGA CGGCCAGTAT G 
(2) INFORMATION FOR SEQ ID N0:5 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGTAAAACGA CGGCCAGTAT GC 
12) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO. 6: 
TGTAAAACGA CGGCCAGTAT GCA 
(2) INFORMATION FOR SEO 10 NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TGTAAAACGA CGGCCAGTAT GCAT 

(2) INFORMATION FOR SEO ID NO:8: 

(l) SEQUENCE CHARACTERISTICS 
(A) LENGTH: 25 base pairs 
(8) TYPE: nucleic acid 

(C) STRANOEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
TGTAAAACGA CGGCCAGTAT GCATG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
(0) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
TGTAAAACGA CGGCCACG 

(2) INFORMATION FOR SEQ ID NO: 10: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(8) TYPE: nucleic acid 
(C) STRANOEDNESS: single 
(0) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
TGTAAAACGA CGGCCAGCG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 20 base pairs 
(8) TYPE: nucleic acid 
(C) STRANOEONESS : single 
(0) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO.ll: 
TGTAAAACGA CGGCCAGCGT 
(2) INFORMATION FOR SEQ ID NO: 12: 

« 

(i) SEQUENCE CHARACTERISTICS 
(A) LENGTH: 21 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGTAAAACGA CGGCCAGCGT A 
(2) INFORMATION FOR SEQ 10 NO. 13: 

(i) SEQUENCE CHARACTERISTICS" 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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TGTAAAACGA CGGCCAGCGT AC 
(2) INFORMATION FOR SEQ 10 NO: 14. 
(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(8) TYPE nucleic acid 

(C) 5TRAN0EDNESS : single 

(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 N0.14: 
TGTAAAACGA CGGCCAGCGT ACC 
(2) INFORMATION FOR SEQ 10 NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



<xi) SEOUENCE DESCRIPTION: SEQ ID NO: 15: 
TCGAGGTCGA CGGTATCG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 
GCCGC7CTAG AACTAGTG 
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Claims 

1. A method comprising: 

(a) providing DNA fragments, each fragment having deavably attached thereto a mass tao 

(b) separating the tagged fragments on the basis of fragment charae s^,«n5L?i^ * 

(c) placing a charge on the tag and cleavina the iJS^Z^t. ^? T" ^ 8,26 or shape: 
prior to. during, or after deavin^e^^Sg^^'" 601, ^ ^ iS p,aced °" *° ^ 

(d) determining each tag by mass spectrometry. 

2. A method for determining the sequence of a nucleic acid molecule, comprising: 

(a) generating tagged nucleic acid fragments which are complementary to a selected tarnaf n,,-,^ ^ _ 
ecule. wherein a tag is an organic moietv that « ,-nrr^ti.,!. .-J! !l selected target nucleic acid md- 

'5 potentiometry; 9 * 15 corre,ative ^ a Particular nucleotide and detectable by 

(b) separating the tagged fragments on the basis of fragment charae ««■ lon «=i i u ■ 

(c, detecting the tags by potentiometry. and ^^^^^0 IXSSS^ZL 

A method for determining the sequence of a nucleic acid molecule, comprising: 

(a) generating tagged nucleic acid fragments which are complemsntarv to a «i^«^^ . ■ 
ecule. wherein a tag is an organic moietv that ■= complementary to a selected target nucleic acid mol- 

trometry or potentiometry: 

/ corrective wrm a particular nucleotide and detectable by spec- 

(b) separating the tagged fragments on the basis of fragment charge, size or shape- 

(c) cleaving the tags from the tagged fragments: and 

(d) detecting the tags by spectrometry or potentiometry. and therefrom deter,™-,;,*, *.„ . 
acid molecule. y meretrom determining the sequence of the nudeic 

4. A compound of the formula T^-L-X wherein. 



3. 

20 



40 5. A compound of the formula T™-L-X wherein, 

acid; and « ra meiry ana is tertiary amine, quaternary amine or organic 



so 



X is a nucleic add; 

where the compound is not bound to a solid support 



6 ' ^°STS d t a "*5 <0 "* "I" °' daims 4 or 5 wherei " <- « selected from i* l— U— L W l« l« . 

I WherS aCt,n ' C rad,at,on - acid. base, oxidation, reduction enzyme electr^hemi^, *! , ' L 1 

exchange, respectively, cause the ^-containing moiety to be c^^L rZlS^^ZT *** 

55 7. A composition comprising first and second compounds of the formula T™-L hv "X wherein. 

L v is a chemical group which, upon exposure to lioht of selected wauoinnMh , _ 
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and 

X is nucleic acid; 

with the proviso that the variable mass component in the first compound has a mass that is not identical to the 
mass of the variable mass component of the second compound, but the T™ groups in the first and second 
compounds are otherwise identical. 

8. A compound according to any one of claims 4 or 5 wherein L is l>. or a composition comprising compounds 
according to claim 7. wherein L™ has the formula L 1 -L 2 -L 3 and -L 2 -L 3 has the formula: 




NO 



with one carbon atom at positions a. b. c. d or e being substituted with -L 3 -X and optionally one or more of 
positions b. c d or e being substituted with alkyl, alkoxy, fluoride, chloride, hydroxyl. carboxylate or amide- and R 1 
is hydrogen or hydrocarbyl. 

9. A compound or composition according to claim 8 wherein -L 3 -X is located at position a. 

10. A compound, or composition comprising compounds, according to any one of claims 4-9 where the coroound 
comprises the formula: 

T 4 
I 

Amide 

I 

o <W 



R« ° 



wherein 



G is (CH^e wherein a hydrogen on one and only one of the CH 2 groups of each G is replaced with-fCH,) - 
Amide-T 4 ; z/c 



T 2 and T 4 are organic moieties of the formula C 1 . 2 5N 0 . 9 O 0 . 9 S 0 .3P 0 . 3 H a F |J ^ wherein the sum of a, p and 8 is 
sufficient to satisfy the otherwise unsatisfied valencies of the C, N. O, S and P atoms- 



Amide is 



o 
ii 

N-C 
'l 



or 



O 
II 

C 



N 
I 

R 



1 
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R 1 is hydrogen or C,. 10 alkyl; 

c is an integer ranging from 0 to 4 and 

^JSS. ranginQ ,r ° m ,tO50SUChthatWhen ^ ^ater than 1,G,c, An** R> and r» are independ- 
t^ttZSZZ*" C ° mPriSin9 COmPOUndS ' aCC °^ to •* « * 4-9 where me compound 

T 4 
I 

Amide 




or quaternary amine or an organic acid; and m is an inSJer ra^ngfSr foT * '"^ 8 

1Z ^ COmPriS,n9 COmPOUndS ' aCCOrdi ^ to «* <™ - Caims 4-9 where the compound 




wherein T 5 is an organic moiety of the formula C «n„ n q d □ c 
is suff*ent to satis* the otherwise -satis*™ g^f-JJ «■ • 

or quaternary amine or an organic acid; and m is an integer ranging from 0 49 3 t8rt,ary 

3. A composition comprising a plurality of compounds accordino to anv one of riaim* a ~ c „ 

7. wherein no two compounds in the compTtion have J£X2££1^Z£Z * ^"^^ - ctalm 

4. A compound according to claim 5. or a composition according to Cairn 7. wherein X is a DNA sequencing primer. 

5. A compound of the formula (T n,s ) n -L-X wherein. 

T™ is an organic group detectable by mass spectrometry, comprising carbon at least nn« «r h„H,„ 
onde. and optional atoms selected from oxygen, nitrogen, sZr. phospl^us 2552? ° hydr ° 9en 
' S an ° f9an,C 9r ° UP Wh ' Ch a,,0WS 3 Containing moiety to be cleaved from the remainder of the compound. 
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wherein the T^-containing moiety comprises a functional group which supports a single ionized charge state 
when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine or organic 
acid; 

X is a functional group selected from hydroxyl, amino, thiol, carboxylic acid, hatoalkyf. and derivatives thereof 
which either activate or inhibit the activity of the group toward coupling with other moieties; and 
n is a number of T" 8 groups, where n is an integer greater than one. 

1 6. A compound of the formula (T^Jn-L-X wherein. 

T™ is an organic group detectable by mass spectrometry, with the proviso that T™ does not comprise reporter 
groups. 

L is an organicgroup which allows a T™-containing moiety to be cleaved from the remainder of the compound 
wherein the T^-contaimng moiety comprises a functional group which supports a single ionized charge state 
when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine or organic 
acid; * 

X is a nucleic acid; and 

n is a number of T™ groups, where n is an integer greater than one. 

17. A compound of the formula T^'-L-X wherein. 

T™ is an organic group detectable by mass spectrometry with the proviso that T™ does not comprise reoorter 
groups; ^ 

L is an organicgroup which allows a ^-containing moiety to be cleaved from the remainder of the compound 
wherein the T™-containing moiety comprises a functional group which supports a single ionized charge state 
when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine or organic 
acid; and 

X is a molecule of interest (MOI) selected from proteins, peptides, antibodies or antibody fragments receptors 
receptor ligands. members of a ligand pair, cytokines, hormones, oligosaccharides, synthetic organic rW 
cules, and drugs. 

18. A compound according to claims 4. 5. 15. 16 or 1 7 wherein the tag is non-volatile and thermally labile. 
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FIGURE 1 
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FIGURE 2 
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FIGURE 4 
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FIGURE 8 
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